1 CCCAQQQCGC CGTAGGCGGT GCATCCCGTT CGCGCCTGGG GCTGTGGTCT 
51 TCCCGCGCCT GAGGCGGCGG GGGCAGGAGC TGAGGGGAGT TGTAGGGAAC 
101 TGAGGGGAGC TGCTGTGTCC CCOGCCTCCT CCTCCCCATT TCCGGGCTCC 
151 CGGGACCATG TCCGCGCTGG CGGGTGAAGA TGTCTGGAQG TGTCCAGGCT 
201 GTGGGGACCA CATTGCTCCA AGCCAGATAT GGTACAGGAC TGTCAACGAA 
251 ACCTGGCACG GCTCTTCCTT CCGGTGAAAG TGATGCGCAG CCTGGACCAC 
301 CCCAATGTGC TCAAGTTCAT TGGTGTGCTG TACAAQGATA AGAAGCTGAA 
351 CCTGCTGACA GAGTACATTG AGGGGGGCAC ACTGAAGGAC TTTCTGCGCA 
401 GTATGGATCC GTTCCCCTGG CAGCAGAAGG TCAGGTTTGC CAAAGGAATC 
451 GCCTCCGGAA TGGACAAGAC TGTGGTGGTG GCAGACTTTG GGCTGTCACG 
501 GCTCATAGTG GAAGAGAGGA AAAGGGCCCC CATGGAGAAG GCCACCACCA 
551 AGAAACGCAC CTTGCGCAAG AACGACCGCA AGAAGCGCTA CACGGTGGTG 
601 GGAAACCCCT ACTGGATGGC CCCTGAGATG CTCAACGGAA AGAGCTATGA 
651 TGAGACGGTG GATATCTTCT CCTTTGGGAT CGTTCTCTGT GAGATCATTG 
701 GGCAGGTGTA TGCAGATCCT GACTGCCTTC CCOGAACACT GGACTTTCGC 
751 CTCAACGTGA AGCTTTTCTG GGAGAAGTTT GTTCCCACAG ATPGTCCCCC 
801 GGCCTTCTTC CCGCTGGCCG CCATCTGCTG CAGACTGGAG CCTGAGAGCA 
851 GACCAGCATT CTCGAAATTG GAGGACTCCT TTGAGGCCCT CTCCCTGTAC 
901 CTGGGGGAGC TGGGCATCCC GCTGCCTGCA GAGCTGGAGG AGTTGGACCA 
951 CACTGTGAGC ATGCAGTACG GCCTGACCCG GGACTCACCT CCCTAGCCCT 
1001 GGCCCAGCCC CCTGCAGGGG GGTGTTCTAC AGCCAGCATT GCCCCTCTGT 
1051 GCCCCATTCC TGCTGTGAGC AGGGCCGTCC GGGCTTCCTG TGGATTGGCG 
1101 GAATGTTTAG AAGCAGAACA AACCATTCCT ATTACCTCCC CAGGAGGCAA 
1151 GTGGGCGCAG CACCAGGGAA ATGTATCTCC ACAGGTTCTG GGGCCTAGTT 
1201 ACTGTCTGTA AATCCAATAC TTGCCTGAAA GCTGTGAAGA AGAAAAAAAC 
1251 CCCTGGCCTT TGGGCCAGGA GGAATCTGTT ACTCGAATCC ACCCAGGAAC 
1301 TCCCTGGCAG TGGATTGTGG GAGGCTCTTG CTTACACTAA TCAGCGTGAC 
1351 CTGGACCTGC TGGGCAGGAT CCCAGGGTCA ACCTGCCTGT GAACTCTGAA 
1401 GTCACTAGTC CAGCTGGGTC CAGGAGGACT TCAAGTGTGT GGACGAAAGA 
1451 AAGACTGATG GCTCAAAGGG TGTGAAAAAG TCAGTCATGC TCCCCCTTTC 
1501 TACTCCAGAT CCTGTCCTTC CTGGAGCAAG GTTGAGGGAG TAGGI I I IGA 
1551 AGAGTCCCTT AATATGTGGT GGAACAGGCC AGGAGTTAGA GAAAGGGCTG 
1601 GCTTCTGTTT ACCTGCTCAC TGGCTCTAGC CAGCCCAGGG ACCACATCAA 
1651 TGTCAGAGGA AGCCTCCACC TCAIGI 1 1 IC AAACTTAATA CTCGAGACTG 
1701 GCTGAGAACT TACGGACAAC ATCCTTTCTG TCTGAAACAA ACAGTCACAA 
1751 GCACAGGAAG AGGCTGGGGG ACTAGAAAGA GGCCCTGCCC TCTAGAAAGC 
1801 TCAGATCTTG GCTTCTGTTA CTCATACTCG GGTCGGCTCC TTAGTCAGAT 
1851 GCCTAAAACA I I I IGCCTAA AGCTCGATGG GTTCTGGAGG ACAGTGTGGC 
1901 TTGTCACAGG CCTAGAGTCT GAGGGAGGGG AGTGGGAGTC TCAGCAATCT 
1951 CTTGGTCTTG GCTTCATGGC AACCACTGCT CACCCTTCAA CATGCCTGGT 
2001 TTAGGCAGCA GCTTGGGCTG GGAAGAGGTG GTGGCAGAGT CTCAAAGCTG 
2051 AGATGCTGAG AGAGATAGCT CCCTGAGCTG GGCCATCTGA CTTCTACCTC 
2101 CCAIGI I IGC TCTCCCAACT CATTAGCTCC TGGGCAGGAT CCTCCTGAGC 
2151 CACATGTGCA GGTACTGGAA AACCTCCATC TTGGCTCCCA GAGCTCTAGG 
2201 AACTCTTCAT CACAACTAGA TTTGCCTCTT CTAAGTGTCT ATGAGCTTGC 
2251 ACCATATTTA ATAAATTGGG AATGGGTTTG GGGTATTAAA AAAAAAAAAA 
2301 AAAAAAAAAA AAAAAAAAAA (SEQ ID NO:l) 

FEATURES: 

5*UTR: 1-228 

Start Codon: 229 

Stop codon: 994 

3'UTR: 997 
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Homologous proteins: 
TOP 10 BLAST Hits 



CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 



1000682328847 /altid=gi 
18000005015874 /altid=gi 
88000001156379 /a"ltid=gi 
88000001156378 /altid=gi 
18000005154371 /altid=gi 
18000005126937 /altid=gi 
18000005127186 /altid=gi 
18000005127185 /altid=gi 
18000005004416 /altid=gi 
18000005004415 /a!tid=gi 



8051618 /def=ref|NP_057952.1| UMd.. 
5031869 /def=ref|NP_005560.1| UM .. 
7434382 /def=pir| |X5814 UM motif. . 
7434381 /def=pir| 1X5813 LIM motif. . 
7428032 /def=pir||JE0240 UM kinas.. 
6754550 /def=ref|NP_034848. 1| UM .. 
2804562 /def=dbi | BAA24491. 1| (AB00.. 
2804553 /def=dbj I BAA24489.il (AB00. . 
2143830 /def=pir| 1 178847 UM motif. . 
1708825 /def=sp|P53670|UK2_RAT LI.. 



BLAST dbEST hits: 



91 
91 
91 
91 
91 
91 



10950740 /dataset=dbest /taxon=96. . 
10156485 /dataset=dbest /taxon=96. . 
5421647 /dataset=dbest /taxon=9606 
10895718 /dataset=dbest /taxon=96. . 
13043102 /dataset=dbest /taxon=960. 
519615 /dataset=dbest /taxon=9606 / 
11002869 /dataset=dbest /taxon=96. . 



EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 

From BLAST dbEST hits: 



91 
91 
91 
91 
9i 

gi 
gi 



10950740 

10156485 

5421647 

10895718 

13043102 

519615 

11002869 



teratocarcinoma 

ovary 

testis 

nervous_normal 
bladder 
infant brain 
thyroid gland 



Score 


t 


AQZ 




485 


e-136 


AM 
HO? 


c _LjJL 


469 


e-131 


ACQ 


e-±o± 


acq 


fl 151 


acq 


r\ 1 31 


ACQ 


fl 131 


ACQ 


A 1 31 


ACQ 


rs 1 31 


Score 


E 


1049 


0.0 


975 


0;0 


952 


0.0 


757 


0.0 


714 


0.0 


531 


e-149 


511 


e-143 



From tissue screening panels: 
Fetal whole brain 
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1 MVQDGQRNLA RLLLPVKVMR SLDHPNVLXF IGVLYKDKKL NLLTEYIEGG 
51 TLKDFLRSMD PFFWQQKVRF AKGIASGMX TWVADFGLS RUVEERKRA 
101 PMEKATTKKR TLRKNDRKKR YTWOJPVIaM APEMLNGKSY DETVDIFSFG 
151 IVLCEIIGQV YADPDCLPRT LDFGLNVKLF WEKFVPTDCP PAFFPLAAIC 
201 CRLEPESRPA FSKLEDSFEA LSLYLGELGI PLPAELEELD HTVSMQYGLT 
251 RDSPP (SEQ ID NO: 2) 



FEATURES: 

Functional domains and key regions: 

[1] PDOC00004 PS00004 CAMP_PHOSPHQ_SrTE 

camp- and cQviP-dependent protein kinase phosphorylation site 

Number of matches: 2 

1 108-111 KKRT 

2 119-122 KRYT 

[2] PDOC00005 PS00005 PKCLPHOSPHOLSTTE 
Protein kinase C phosphorylation site 

Number of matches: 4 

1 51-53 TLX 

2 106-108 TTK 

3 107-109 TKK 

4 111-113 TLR 

■ i' 

[3] PDOG00006 PS00006 CK2_PHOSPHO_J5ITE 
Casein kinase n phosphorylation site 

Number of matches: 4 

1 51-54 TLKD 

2 76-79 SCMD 

3 139-142 SYDE 

4 212-215 SKLE 

[4] PDOC00008 PS00008 MYRISTYL 

N-myristoylation site 

Number of matches: 4 

1 73-78 GTASQWI 

2 77-82 OCK1V 

3 150-155 GIVLCE 

4 158-163 GQVYAD 



Membrane spanning structure and domains: 
Helix Begin End Score certainty 

1 142 162 0.872 Putative 

2 184 204 0.652 Putative 
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BLAST Alignment to Top Hit: 

>CRA| 1000682328847 /altid=gi 18051618 /defWef |NP_057952.1| L3M 
domain kinase 2 isoform 2b [Homo sapiens] /org=Homo 
sapiens /taxon=9606 /dataset=nraa /length=617 
Length = 617 

Score = 485 bits (1235) , Expect = e-136 

Identities = 241/265 (9050, Positives = 241/265 (90B0, Gaps = 22/265 (830 

Query: 13 LLPVKVMJ^UDHPNVIXFI^ 72 

L VKVMRSLDHPNVlKraGVLYKDK^ 
Sbjct: 353 LTEVKVMRSU)HPhMJ<FIGvLYI<D]^^ 412 

Query: 73 GIASGM DKTWVADreLSf^IVEERKRAPMEKATTKKR 110 

GIASGM DKTVWADreLSRLIVEERKRAP^CKATTKKR 
Sbjct: 413 GDASGWLHSNOIHRDIJJSHN^ 472 

Query: 111 TLRKNDRKKRYTWGNFWIAPEML^ 170 

TIJW«*KKRYTWGNPYV\^ 
Sbjct: 473 TLRKhOUCKRYTWGNPYWMAPEMlJJGKS^ 532 

Query: 171 IJ)reUWKLFWEKFVFTDa 3 PAFFPLA^COlLEPES(^AF^KLEDSFEAl^LYLGEL^ 230 

LDFGUWKLFWEKFVPTDCPPAFFPLAAICCRLEPESIVAFS 
Sbjct: 533 U)RjUnIVKLFWEKFVPTDCPPAFFPLAAICCRLEPES^ 592 

Query: 231 PLPAELEEU>ITV5MQYGLTRDSPP 255 

PLPAELEELDHTVSMQYGLTRDSPP 
Sbjct: 593 PLPAELEEUDHTVSMQYGLTRDSPP 617 (SEQ ID NO: 4) 



Hmmer search results (Pfam): 



Model 


Description 


Score 


E-value 


N 


PF00069 


Eukaryotic protein kinase domain 


100.1 


l.le-26 


2 


CE00031 


CE00031 VEGFR 


4.9 


0.14 


1 


CE00204 


CE00204 FIBR0BL^STjGROWTH_RECEPT0R 


4.7 


1 


1 


CE00359 


E00359 bone_morphogenetic_protein_receptor 


1.8 


7.9 


1 


CE00022 


CE00022 MAGUK^subf ami 1 y_d 


1.5 


2.5 


1 


CE00287 


CE00287 PTKJEph_orphan_receptor 


-48.4 


3.8e-05 


1 


CE00292 


CE00292 PTK_membrane_j5pan 


-61.8 


2.1e-05 


1 


CE00291 


CE00291 PTK_fgf_receptor 


-113.0 


0.027 


1 


CE00286 


E00286 PTK_EGF_receptor 


-125.1 


0.0021 


1 


CE00290 


CE00290 PTK_Trk_family 


-151.3 


6.5e-05 


1 


CE00288 


CE00288 PTK_Jnsulin_receptor 


-210.4 


0.014 


1 



Parsed for domains: 



Model 


Domain 


seq-f 


seq-t 




hmm-f hmm-t 




score 


E-value 


PF00069 


V2 


16 


79 




41 


105 




52.1 


2.3e-13 


CE00022 


VI 


124 


153 




187 


216 




1.5 


2.5 


PF00069 


2/2 


81 


156 




129 


182 




48.0 


3.1e-12 


CE00031 


VI 


129 


156 




1114 


1141 




4.9 


0.14 


CE00204 


VI 


129 


156 




705 


732 




4.7 


1 


CE00359 


VI 


79 


157 




287 


356 




1.8 


7.9 


CE00290 


VI 


9 


218 




1 


282 




-151.3 


6.5e-05 


CE00287 


VI 


1 


218 


[. 


1 


260 




-48.4 


3.8e-05 


CE00291 


VI 


1 


218 


[• 


1 


285 




-113.0 


0.027 


CE00292 


VI 


1 


218 


[• 


1 


288 




-61.8 


2.1e-05 


CE00288 


VI 


1 


218 


[• 


1 


269 




-210.4 


0.014 


CE00286 


VI 


6 


218 


• • 


1 


263 




-125.1 


0.0021 
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1 TCATCCTTCC GCAGGGGCCA TGCTAACCTT CTGTGTCTCA GTCCAATTTT 
51 AATGTATGTG OGCTGAAGC GAGAGTACCA GAGGTTTTTT TGATGGCAGT 
101 GACTTGAACT TATTTAAAAG ATAAGGAGGA GCCAGTGAQG GAGAGGGGTC 
151 CTGTAAAGAT AACTAAAAGT GCACTTCTTC TAAGAAGTAA GATGGAATGG 
201 GATCCAGAAC AGGGGTGTCA TACGGAGTAG CCCAGCCTTT GTTCCGTGGA 
251 CACTGGGGAG TCTAACCCAG AGCTGAGATA GCTTGCAGTG TGGATGAGCC 
301 AGCTGAGTAC AGCAGATAGG GAAAAGAAGC CAAAAATCTC AAGTAGGGCT 
351 GGGGTGAAGG ACAGGGAAGG GCTAGAGAGA CATTTGGAAA GTGAAACCAG 
401 GTGGATATGA GAGGAGAGAG TAGAGGGTCT TGATTTCGGG TCTTTCATGC 
451 TTAACCCAAA GCAGGTACTA AAGTATGTGT TGATTGAATG TCTTTGGGTT 
501 TCTCAAGACT GGAGAAAGCA GGGCAAGCTC TGGAGGGTAT GGCAATAACA 
551 AGTTATCTTG AATATCCTCA TGGTGGAAAG TCCTCATCCT GTTTGAATTT 
601 TQGAAATAGA AATCATTCAG AGCCAAGAGA TTGAATTGTT GAGTAAGTGG 
651 GTGGTCAGGT TACAGACTTA ATTTTGGGTT AAAAAGTAAA AACAAGAAAC 
701 AAGGTGTGGC TCTAAAATAA TGAGATGTGC TGGGGGTGGG GCATGGCAGC 
751 TCATAAACTG ACCCTCAAAG CTCTTACATG TAAGAGTTCC AAAAATATTT 
801 CCAAAACTTG GAAGATTCAT TTGGATGTTT GTCTTCATTA AAATCTCTCA 
851 CTAATTCATT GTCTTGTCCA CTGTCCGTAA CCCAACCTCG GATTGGTTTG 
901 AGTGAGTCTC TCAGACTTTC TGCCTTGGAG TTTGTGAGAG AGATGGCATA 
951 CTCTGTGACC ACTGTCACCC TAAAACCAAA AAGGCCCCTC TTGACAAGGA 
1001 GTCTGAGGAT TTTAGACCCA GGAAGAATGA GTGATGGGCA TATATATATC 
1051 CTATTACTGA GGCATGAGAA GAGTGGAATG GGTCGGTTCA GGIGGIGI II 
1101 TAAGGCCTCT TCCCAGCTTG TTTAACTCTT GTCTGGGGAA CGAGGGGGAC 
1151 AACTGTGTAC ATTGGCTGCT CCAGAATGAT GTTGAGCAAT CTTGAAGTGC 
1201 CAGGAGCTGT GCTTTGTCTA TTCATGGCCC CTGTGCCTGT GAAACAGGGT 
1251 TCGGTGACTG TCACTGTGCC TGTCGCAGTC TGTAGTTACC CAGAGAGAAC 
1301 AAAGCTGCAT ACACAGAGCG CACAAGGGAG TCTTGTAACA ACCTTGTCCT 
1351 GCTTTCTAGG GCTGAGTCAG GTACCACAGC TTGATCTCAG CTGTCCTCTT 
1401 TATTTCAAGA AGTTGACATC TGAGCCATAC CAGGAGTATT GTAI I I IGI I 
1451 TGAGGCCTCT CTTTTTGGAG GAACATGGAC CGACTCTGTG CTTTTGTCTA 
1501 TGCTGGTCTC TGAGCTCACA CAACCCTTCA CCCTCCTTTC TCAGCCAGTG 
1551 ATAGGTAAGT CTTCCCTATC TTGCAAGGCT CAGCTCAAGT GTCAGCTTCC 
1601 TCTACAAAGA CTTTCCTGGT TCCCCTCATT GGAGTGAACA AGAGTTGACA 
1651 TGGTAGAATG GAAAGAGCAG AAGCTTTAGA ATGAGCCAGA CCTGAGTATG 
1701 AATGCTAGAT CCACCACTTA GCTAGTCAAC CCTGCCCCCT GCCTCAAGTT 
1751 TTAATTTTCC TATCCATTAA GTGAATATAA TAATACCTGT GTCACAGGAT 
1801 TATTTTGAGA ATTAAATGAG ATTAGGTCTA TGAAAGCACC TAGCAGAGTT 
1851 CTTGGCATAT AGGAGGCATT CATTAAATAT TTGTTCTTCC CCTTTTATAC 
1901 CCATTACTTT TCI I I I ICTG AACTAAAATA ATACTTGGTT CTATCTCTGA 
1951 AATAACATCC AAGTGAAAAA TCAACAACAT GAAAGAGCAG TTCTTTTCCA 
2001 GTGGATTTGC TTCTTAAGGA GCAGAGATTA TGTAATCTAA CAGCCTCCAA 
2051 CATACAAAGA GCTTTGTATC TAGAACAGGG GTCCCCAGCC CCTGGACCGC 
2101 CAACTGGTAC GGGTCTGTAG CCTGTTAGGA ACCAGGCTGC ACAGCAGGAG 
2151 GTGAGCGGCG GGCCAGTGAG CATTGCTGCC TGAGCTCTGC CTCCTGTCAG 
2201 ATCAGTGGTG GCATTAGATT CTCATAGGAG TGTGAACCCT ATTGTGAACT 
2251 GCACATGCAA GGGATCTGGG TTGCATGCTC CTTATGAGAA TCTCACTAAT 
2301 GGCTGATGAT CTGAGTTGGA ACAGTTTGAT ACCAAAACCA TCCCCCCGCC 
2351 CCCCAACCCC CAGCCTAGGG TCCGTGGAAA AATTGGCCCC TGGTGCCAAA 
2401 AAGGTTGAGG ACTGCTGATC TAGAGGACCA ATTTATTCAA TGTTGGTTGA 
2451 GTAAATGAGC TCTTGGATTA GGTGATGGAA AAATCTGAAA AAACAGGGCT 
2501 TTTGAGGAAT AGGAAAAGGC AGTAACATGT TTAACCCAGA GAGAAGTTTC 
2551 TGGCTGTTGG CTGGGAATAG TCATAGGAAG GGCTGACACT GAAAAGAAGG 
2601 AGATTGTGTT CGTTTCTTCT TCTCAGAGCT ATAAGCAAAG GCTGAAAGTT 
2651 CTAGAAAAAG GCAAGI I I IG TTTCAGTAGA AAAAAGGATA ATCAGAACCA 
2701 1 1 1 1 IAGAAA ATGGAATGAG ACTACTTTTG AGGCCATGAG TTCCTTGTCC 
2751 CTGGAGAGAT GAGCAGAGGT TGGACAAGTG CTTACCAGAG ATCTTGTGGA 
2801 GGCAGAAACT GTGCATCTAG CAGAGCATTG GCCTAACCCT TTCAAATGAG 
2851 ATGCTGTTAA CTCAGTCTTA TTCTACATGG TAGGAATCCT GTCCCTTTGC 
2901 CTCCTGCTAC TTTGGGCCTC TCAACCTCTT GGTTTTGTGT GCAGGTGAAG 
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2951 ATCTCTGGAG GTGTCCAGGC TCTGGQGACC ACATTGCTCC AAGCCAGATA 
3001 TGGTACAGGA CTGTCAACGA AACCTGGCAC GGCTCTTGCr TCGGGTAGGT 
3051 QQGCCTATCC TCCCATCTTT ACCAGTGTAC TATGGGCCAA GCACTATTTC 
3101 ATGTTCTGAT GGAAAACACA GAAACAAGCT TCTCAGTTGA GAATTTCAAT 
3151 CTTAQGGTCG GGAAAGGAAT GTACCAAGGA AGAGCTCATG ACCAAACCTC 
3201 AAGTGTGGGC CCCCTGAACC CAGGTTAAAT TGGAAGAGCC ATAAATQQGC 
3251 CAGCTGGAQG CAGGGTGGGG GGATGAGAGG AGCCCTTTCC AGGGTTGTCC 
3301 CATATCCCTC ACTTTA7GGG TCAGGAAACT GAGGCCCAGG AAGAGTGACT 

3351 TTCCTCjTGGC TGCACTACAG attatgcagg tacttcaaga gttgtttgta 

3401 TTCTTATTTT ATTTTATTTT ATTTTATTTT ATTTTATTTT ATTTTATCAG 
3451 AGGGATTCTT GCTGTTGCCC AGGCTGGAGT GCAGTGGTGC AATCTCGGCT 
3501 CACTGCAATC TCTGCCTGCT GGGTTCAAGT GATTTTTCTG CCTTAGCTTC 
3551 CTGAGTAGCT GAGATGACAG GCACCTGCCA CCATGGGCAG CTAATTTTTG 
3601 TATTTTAGTC GAGACGGGGG TTTCAACATG TTGGTCAGGC TGGTCTTCAA 
3651 CTCCTCACCT CAAATGATGC ACCCACCTX3G ACCTCCCAAA GTGCTQGAAT 
3701 TACAGGCGTG AACCACTGTG CCCAGCCAAG AGTTGTTTTT AGTGTGGT7G 
3751 GCAGAGCCAG CTCTTCCTTC ACCACAGGAT GCCTCCCTAG GTTCCTACTT 
3801 I I IGI I ACTA GCTTTTATTA TAGCTATATT ATTATTATTA TTATTATTAT 
3851 TATTATTATT ATTATTGAGA CAGAGTCTCG CTCTGTCGCC CAGGCTGGTG 
3901 TACAGTGGTG CGATCCCGGG CTCACTGGAA CCTCTGCCTC CCGAGTTCAA 
3951 GCAGTTCTCC TGCCTCAGCC CCCCGAGTAG GTGGGAGTAC AGGCGCCTGC 
4001 CACCACACCC GGCTAATTTT TGTAI I 1 1 IA GTAGAGACGG GGTTTCACCT 
4051 TGTTGACCAG GCTGGTCTGG AGGTCCTGAC CTCAGGTAAG TGCTAGAATC 
4101 ACAGGCGTGA ACCACTGCGC CCAGCCAAGA Gi IGI I I I I A GTGTGGTTGG 
4151 CAGAGCCAGC TCTTCCTCAC CACAGGTTGC CTCCCTAGGT TCCTACTT7T 
4201 TCTTACTAGC TTTATTATAG CTACATTATT ATTATTATTG TTATTATTAT 
4251 TGAGACAGAG TCTCGCTCTG TCGCCCAGGC TGGTGTACAG TGATGTGATC 
4301 TTGGCTCACT GCAACCTCTG CCCCCCGAGT TCAAGCAATT CTCCTGCTTC 
4351 AGCCCCCCTA GTAGGTGGGA CTCCAGGCAC CTGCCACCAC GCCCAGCTAA 
4401 1 1 1 1 IGTATT TTTAGTAGAG GCGGGGTTTC ACCTTGTTGG CCAGGCTGGT 
4451 CTCAAACTCC TGACCTCAGG TGATCCGCCT GCCTGGGCCT CCCAAAATGT 
4501 TGGGATTACA GGCATGAGCC ACCGCGCGCT GCCTATAGCT ACATTATTTT 
4551 TGTAGGCAGC TCAGTTTCTT AAAAATTATA CAGACTTCAA ATCAGATTTG 
4601 TTCCTGCTGT CTGAGGCTCA GTTTCTTCAT CTGGAAAATG GATGGTAATA 
4651 ATCTTGTTGA GATTGAATGA AATAATATAT GCAGTGTATC CAGTACATGG 
4701 TAGACACCCA GTGAATGGTT ATTCCTTCCT CCCATCGGAT TGGAATTCTC 
4751 AAGGGTGGGA ACTTGTCTTT ATATTCTTCA CAACGTAAAA TAGTTGAAAT 
4801 I IGI PGGTGG AAAGAAGAGC AGTCCACTCC AGAGGCTGGA TGGGCATGCC 
4851 TGGCCCCCAA GGTCTGAAGT GGTAGGGCTG TGCCTATATC CTGAGAATGA 
4901 GATAGACTAG GCAGGCACCT TGTGCTGTAG ATTCCAGCTC CTGCACATAG 
4951 CTCTTGTTGT AAAACATCCC TGTGCTTATA CCAAGTAATT GAGTTGACCT 
5001 TTAAACACTT GCCTCTTCCC TGGGAACCAT ATAGGGGATT GGCCTGGAGA 
5051 CGTCTGGCCT CTGGAAGAGT TGGAAAGCAG CCATCATTAT TATCCTTTCC 
5101 TTTCAGCTAT AACTCAGAGC TCTCAAGTCT I I IUGIGGA TCTTATTGCC 
5151 TTGGTTCTTG CCCCTTTTAC TCCCAGGGAA GTTGATTCTG TCTTTTCTGT 
5201 TCCATTTAGT ATGACAGGAG CAGAGAATGT CAGAGCTGTA AGGGACCTTA 
5251 TAGTTAAAGC CTTTGGCTGG TCCTTTCATT TTATAGCTCG GACTAATAAG 
5301 TAACGTCAAA ACCCAATGAG TTCACAGATT GGGTCTCGCC TTGGCATGTA 
5351 ACCCATATGT TCATATTCTT GCTGTTTTCC TATGTGTATG AATATTTTCT 
5401 ATCCAAAATA AGCAGGACAG GGTAGAGCAA GTTAATCTTT GGAATTTCTG 
5451 GATTCTCTTA GAGCTAAAAA ACTTCAGAAC TAGAAGAAAC CACCCACTAT 
5501 ATGGTATAAC CCATTCATAT CACAGATGAG GCCTGAAACC AAAAAGACTT 
5551 GCTCAGGCCA TGGATGACAA GAGCTGGCCC TAGCACTGAA CTCTTGGGTC 
5601 ATTTGTAGGT CTAGTCAGAT GCTAGCTTGT TAGCTCTGTG CGTGCGTGTG 
5651 TGTGTGTGTG TGTGTGTGTG TGTGTGAGAT AGAGACAGAA AGATAACATA 
5701 TGTACACAAA TACATAAAGA GGAAGTAGAC ACGTTAGCAT GGTAGATAAG 
5751 AGTACAGGCA GGCCAGGCGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT 
5801 GGGAGGCCAA GGCAGGTGGA TCACCTGAGG TCAGGAATTC GAGACCAGCC 
5851 TGACCAACAT GGTGAAACCC CATCTCTACT AAATACAGAA AAAAATTAGC 
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5901 TTQGCATGGT GGCACATGCC TGTAATCCCA GCTACTTCQG AAGCTGAAGC 
5951 AGGAGAATOG CTTGAATCOG GGAAGCAGAA GTTGCAGTGA GCCGAGATTG 
6001 TGCCATTACA GTCTAGCCTG GGCAACAAGA GGGAAACTCC ATGGCAAAAA 
6051 AACAACCACC ACCAAGAGTA CAGGCTATCG AATGAGACTA TCGTTTTAAA 
6101 TCCTCGCTTT GCAATTTATT AACTAGCCTT AAGTGACTTC CCTGAGCTTC 
6151 AGGCACCAAT CTGTAAAATG AGGATAAGAA TATTACTCAT GCCACATCGT 
6201 7UTTAGGGAG GATTAAATGT GATAACCTAT ATAAAGTCGC TAGCATAGCA 
6251 TCTGACATAT AGAAAACTCT TAATAGGGCC GGACGTGGTG GCTTATGCCT 
6301 GTAATCCTAG CACTCTQGGA GGCCGAGGCA GAAGGATCGC TTGAGCCCAT 
6351 GAGCCCAGGA GTTTCAGACC AGCCTGGCCA ACATGGCAAA ACTCCACCTC 
6401 TACAAAAAAT ACAAAAATAT TAGCCAGGCG TGATGGCACA CACCTGTAGT 
6451 CCCAGCTACT TGGGAAGCTG AGGAGCGATG ATTACCTGAG CCCAGGGATA 
6501 TCAAGGCTGT AGTGAGCTGT GATCATGCGA CTGTACTCCA TCCAGCTQGG 
6551 GGACAGAGTG AAACCCCTGT CTCAAAACAA AACAAATCAA AAAAAAAACC 
6601 CTTAATAATC AGTAACTGTC ACT7TATATT ATC1 PGTGAG TGTGTGTCTA 
6651 TATACACCTA TATGTATACA TTTCTCTTAT TACACATTCA TTGGTGATCT 
6701 GATGTCGAGC CCCAGGGATT MGGGCAACT TTGAACTACC CTGACACAAT 
6751 CAAGCCAAAT ATCATTCCCG TGGAGGAAGT AGAGTATCTA GGTTCTGTCT 
6801 CCTAGTTGCA GCTTTACCTT GAGGACAGAG ACTCTAATCC AGCTGTGCTG 
6851 AAGGAGCACA TCTCCTGACT TCTGAGCTTT CCCGTQGTAA" ATTCAAACTG 
6901 GATGTCACGG CGCCCTCAGA TAGAGCCTGG TAATTTGCCC TGGGGAGAGT 
6951 GACTGTCTTT TGGATCTAAT TTGACTTTTG CCCCAGTTGG AGGAAAATCT 
7001 TCAGGGCTAG GAAGGATTGT ATTTGTCTGA CCGCAGAGAT. AAGCTGGGTT 
7051 TTGAGGAACA TQGGGCATCA AGCTGAATCG TCTTCTAAGA TCTGTCCCAC 
7101 GCCAGCTTCC CAGTGTTTCT CTGATGAATT TAGAGTACCT GAGTAGTCCA 
7151 GGCCTGCTCG GAGGAGGACT CTCGCTCTGT GCTACTCAGA GAAATTCATT 
7201 CTTCAAGGCC CCCTTCCAGC CTTGCTCTTA CCCAGCTGGG CTACAGTTAC 
7251 AATAAAGGAA ATGACTTTTC TTCTCCCCTT CCCCCAGTAC CTTTGTTTTC 
7301 CTAGTCACAG GGTCGGGCTG GATATTGAAT GGAGAAATTG CTGGGGTCCA 
7351 TCCTAAACTC CTCCCCTCAT CTCTCCCTTA CATTACCCCA TTCTTCTCTC 
7401 TCCAGCCACA TCCATAATCC TGCCTCTGTT AGCCTTCCGA CAGACCCTCA 
7451 GGTGCCCAGG AGAACAGGAA GCTACTTAAA GCTCGAACCT CAGACTGTGC 
7501 AATGGAGGCC AGTGACAAAA CTGAAAGTAG CTCTGTCAGT AATTGTGCTG 
7551 GTGCGATTAG GCAGCTGGCC AGAATCTTTT GGATCTCCTG GACATATGGC 
7601 TGACTAGTCC TCCCAAGCCT TCCCAACAGG CCTCI 1 1 1 1 I TTCCTTTTTT 
7651 TCI I I ICI I I I I I I ILI I IC TTTCTTTCTT TCI I 1 1 I 1 1 I 1 1 1 1 I I I IAG 
7701 GCTAGTGAAG TGAAATTGTG GGAGTGGAAA AGGAAGAAAG AAATCGGTAA 
7751 CTGGTAGTGA TCAATTACTT GTAAACACTA TTGTACTTGG ACCAGCCGAG 
7801 TAGGCCTTTT TTAAAACTCT GAGTTACCTC TCTTTCCTTT CCTTGAGCAG 
7851 TGCCATTAAT TCTGTATCTG GGGCAATCCT TTCTGATGTT CTCTGGACCT 
7901 GGCTCTCTCT CCTTAGGAGA GGCCAGGAGA GTAGCCAGAG AGCATGTCAT 
7951 TTGTAGCTGA GGTTAAAGTG TGGAGCTATC AATGGTGACC TGGCCTCTTG 
8001 GCATGTTAGC AAGCCAGAGG ACCTTGACAA CI I 1 1 I I GAT GATTGTCCGT 
8051 TCACCCTGAT CAAAGGTGTT TGGCTTAGGA GGAGGGAAGA AAAGCTACCC 
8101 CTATTAGTCT TGATGGCCCC AGCGTGGGTC TCTATTGCTT GACCTGGTTC 
8151 CTAGCAGCAT TATCAGAAGG AAAATCGACC GCTCTTAAGG CTCCTGGGAA 
8201 CTTTCAGGAC TTCCTTTCTC AGGATTGCAA ACATAAGACT ATTTGAGCTT 
8251 TCACTTTTGA AAAGCGGTTA CTAATACCTA TACTCTGGGA AAGGGCTMT 
8301 GCAGATAGAA GACTGTGGTC ACTGCATCAG GCAACAGACC ATTTCCGCTA 
8351 AATTTAGTGA CTCCAGGAAG GCCAGTGAAG AAATAACACA GGTAGCAACC 
8401 AGAGACTGTG TTGTAATATG TTGGCTGACA GCAGGGTACT TTCTGTGATG 
8451 CTGAAAGCCA CATTCATTTT CTCTCCCCTC ATCCCCATCT AAGCAAGCCT 
8501 GGTAGAATCA TAATTACAGT AATAGGTACC ACTTATTGAG TACTCTGTGC 
8551 CAGACACCCT CCTGAGCATA CGACATGCAT AGCACATTTA ATCCTTAGAA 
8601 TGACTTAATA AAATGTAGTA CTAGTCTTAC CTACTTCGAG AATAGGGAAA 
8651 TGGAGGTTAC TTGTTTAAAG TCACAGAGCT AATAGGTAGC ATAGCTGAGA 
8701 TTTGAACTCA GGCATTCTTA CTCCTTGCCT GCAAGAGTCT CTTGGCATTC 
8751 TTGAATGCAA GCATATTTCT TAACCTCACT GAGGCTGAGT TTCCTCTTAT 
8801 ATAATATGGG GTAAAGAGCC CTGACCCTGC CTGCGACAGA CTGGTAGTGT 
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8851 CAGATAACAT TGAAGGGTGT TAGTTTAAAG GCTTCATGGA CTOATAATC 
8901 TCAACAAAAG TGCTGTTAAC TTTCTTCTGG GTCTCAGGCT CCTGATGTAG 
8951 AGTCAGTGGA GCAACCCTCC CATCTCCTCT TATGCTGTTG ATCTTGCPGC 
9001 CACACTTACT AACCTAAACC TTTGATTCTG GCTGTGGCCT TCTCCAGAAG 
9051 GTGTTTACTC ATTTGTCCAG TTTATCTTTT AGGAAACAGC CAGCCCGTAG 
9101 ATCATTAAGG CTGGCTATTG GACAGGGGGC TGGGGCCTGC CTGACAGAGG 
9151 AAGGAAGGGC AGACATCTGG TTCTTCCTCT GCCCCTACAA GAGACTCCAG 
9201 CCTGACCACA GAGTGGTACT CCTAGGATGT AGCAGCAGCA TATGAGCTTG 
9251 AATGTGCCTT AATCCTGCTC TTTACTTTGA GAAGAGAGAA CTAAGGACCC 
9301 ACAGATGTTT CACAGCTTCT ATAGGAGGCA GAGGTAGAAA AATGGAGAGA 
9351 GATGAGGCCA GAGATAGATA ACTGATATTA ATTAAACGTT GTATTAAGAA 
9401 CCTCACTTAG ATTATCTGAT TCAATCTTCA TAATAACCCT GCAACCCCCA 
9451 CGI I I I 1 1 IG AGAACAGGGT CTTGCTCTGT TGTCCAGGCT ACAGTGCACT 
9501 GGTACAATCA TAGTTCACTG CAGTGTCAAC CTCCTGAGCT CAAGCAATCC 
9551 TCCCACCTCA GCCTTGCAAG CAGCTTGGAC TACAGGCGTG CCACCACACC 
9601 TTGCCATTTT I I I I IATTTT AAGTAGAAAC MGGTCTTAT TAATACTATG 
9651 TTGCCCAGGC TGGTCTTGAA CTCCAGCGAT CCTCCTGCCC CAGCCTCCCA 
9701 AAGTGCTTGG GATTACGGAA GTAAGCCACT GTGCCTGGCC AGTGCAACCC 
9751 CCATTTTATA CTAAAACAGG AAGGCCCAGA AAGGTTTGGA GTAACTTGTC 
9801 CAGGGTCACA CAGATGATAT TTGAACTCAG GTCTCCGTQG. CTGCGAAGAG 
9851 AGTCTGCTTT CCACTAGGAC TCCCAGGAGA AAAAAAAAAA. AAAAAACAGT 
9901 AGACTTGGAG ACAGAAAATC TGATTTGAGT CTTAGTTGAG CTAGGCTAAC 
9951 TGTGTAACTG TGGGCAAGTT CCTTAGCCCC TGTGAGCCTC AG I 1 1 CI I AT 
10001 CTGTAAAATG TCATAAAAGA AATCCATGTC ATGGAGTAGT; T5TGATGATC 
10051 AAGGACTCTG AAAACATTAG AATGGTTTAA TGTGAAGGAT TAGCAGCAGC 
10101 ACATGGGAAC ATTGTGCATC TTATATTAAC TATCCAAATA TATCAAGCGT 
10151 CATTTGCTAT ATATAAAAGT CATGAAATTA GGCACTGTGG GGGATACGGA 
10201 GTTGGCATAC TAGCCTGGCC TCTTAATTAA TTCATTAATT AGCTTATTTA 
10251 1 1 1 1 IGAGAT AGGTCTTGCT CTATTGCCCA GGCTGGAGTG CAGTGGCATG 
10301 ATGATAGCTT ACTATAGCCT CAATCTCCCA GGCTTAAACA ATCCTCCTGA 
10351 GTAGCTQGGA CTACAGGCAC ACACTACCAT GCCCAGCTAA 1 1 1 1 1 1 1 1 IA 
10401 AIMIIIGIA GAGACAGGGT CTTGCTCTGT TGCCCAGGCT GGTCTCAAAC 
10451 TCCTGGGCTC GAGATCCTCC CACCTGGGCC TCACAAAGTG TTGGGATTAC 
10501 AGGTATGAGC CACGGCACCT GGCCTGGTCT CTTAACTGGT TCCCTAAGAC 
10551 AGCTGGAAAT AGAGAATGTC ATGGAGCATT CCTAACCATG GGCTCCAGCC 
10601 TGGCTTTCAT TCTGTTTCTC CCCTGAAACA ACATTCCTTT AGTAATATTC 
10651 CGAATAACAG CTTCATCAGT CTGTCTACCG ACCACTCTTC AGGCTTCATC 
10701 TTATATGACC TCCCAAACTG CACTAAGGGT TGTATTAGAG AAAAGTGGAT 
10751 AAAGTTCGGA GTCAGGCTGC TTGAGCTTAA ATGCCAGCTT CACTTACCAG 
10801 CCACCTGACC ATGAGTCAGC TGCTTAACCA TTCTTTGCCA CAGTTTCCTT 
10851 GTCTATGAAA AGGGAAATGG CTCCCACCTC AAAAAGTTGT TAACATTAAA 
10901 TTCAATCATG TATTCAAAGT CCTGAGCAGA ATGTCTGGCC ATGACTGGGA 
10951 CTTAACAGAT GTTAGCATTT ATTATTAGTA TCTGTCAGTC TTGAAATGTT 
11001 CTCTTCCCTT GGCTTTCATG ACATTCCACA CTCTCCTGGT TTTCTCTTAC 
11051 CTCTCTGGTA ATACCTGTTT GCTTATCCTT CI I IGICCAG CTCTGGGATG 
11101 TTACCATTCC TTCAGGCGTG CTGTTTTCTC CTTAGGCAGT CTTACACACA 
11151 CTCATGACTT CCTTCCATTG TCCTCCACAC ACTGATGACC CTAAAATCAG 
11201 TATCTCCAGC CTAAACCTTT CCACTGAGTT CTAGACCCAT ATGTTGTACT 
11251 ATCAACCTGG CTTGTCCATT TGAATGTCTT CCAGGCACTT CAGACTCTCT 
11301 TCTCTAGACT TTGCTGGACT TTCACTCTTC CCCCTAAAAC TGGCTCCTCT 
11351 TCCACTGAAA CATGTATGTC ATTGAGAGGC ACCACCATCC ACCCAGTGCC 
11401 TAAGCCAGAA ACCTAGGAAT CCTTGATACC TGTTCTCTCT CATCCTGCAT 
11451 ATCCMGCCT ATCAGTTTTA TCTCTAAATT ATATTTTGGT AGGTTTACTT 
11501 CTTTCCTTTT CTCCCACCAC CACCCTGCTC CAAGCTACCA TCATCTCACC 
11551 TGGATGTCTG CAATAGCCTC ATCTCCCACA GCCACTCTGC ACCCCCTAAT 
11601 CTGTTCTCTA TAGAGCAGTT GGAAGGAGTG Al 1 1 I IGI IG TTTGTTTTGT 
11651 I I IGI I I IAG ACAGAGTCTC ACTCTGTTCC CCMGGCTGG AGTGCAGTGG 
11701 CACAATTTCG GCTCACTGCA ACI ICIGCCT CCCGGGTTTA AGCAATTCTC 
11751 CTGCCTCAGC CTCCCAAGTA GCTGGGATTA AGGCACCGGC CCCCATACCC 
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11801 AGCTAATTTT TATA I II I IA GTAGAGATGG GGTTTTGCCA TGTTQGCCAA 
11851 GCTAGTCTCG AACTCCTGAC CTCAAGTGAT CCACCTGCCT CGGCCTCCCA 
11901 AAGTGCPGGG ATTACAGGTG TGAGCCACTG CACCTGGCTG GAAGGAGTGA 
11951 TCTTAAAAAA AAAAAAAACA AAAAAAAACT TGACTGTGTC ACTCTGTCTT 
12001 GTCTCTCCTA CCTTCTATAC TTCCACAACT TCCCAGTGTT CTTGGATAAA 
12051 GACCAAAATC CTTAACTTGG CCAGGCGCGG TQGCTCACAC CTATCATCTC 
12101 AGCACTTTGG GAGGCCGAGG CAGGCAGATC ATGAAGTCAA GAGATTGAGA 
12151 CCATCCTGGC CAACATGGTG AAACCCCATC TCTACTAAAA ATACAAAAAT 
12201 TAGCTGGTGG TGGTGGCGTG TGCCTGTAGT CCCAGCTACT TGGGAGGCTG 
12251 AGGCAGGAGA ATCACTTGAA CCTGGGAGGC AGAGGTTGGA GTGAGCCOVG 
12301 ATCAGQCCAC TGCACTCCAG CCTGGTGACA GAGTAAGACT CCATCTCAAA 
12351 AAAAAAAAAA AAAAAAAAAA TTCCTTAATT TGGCCTACAG TAGAGCCCTC 
12401 CGTAATGTGG CCTCTCTCCA CATCTCCACA ACCTCCTGCT CCCTGOVCTT 
12451 CAGCCTCACC TCTCI ICIGG ACAGGCCCTC CTTCTGACAA GGGCTTTGTT 
12501 CATTCTGCTC CCTCTGCCTA GAATGCCGCC TTACTCTGTT CACTTAACTC 
12551 CTGCTTATGG TTTAGATCTT TACCTGGATG GCTCAGAGAA ATATAGAAGT 
12601 AATTCCrCAC CCTGAAAAAT AGGTTAGGTC CCTGTTTTAT GTTTTCATAG 
12651 ACCTTTCCTT TGAGGCTTTT TTTAAAAAAG TAGTTTTAAT CTCACATTTA 
12701 TTCATGTGAT CATCTCCTTA ATGATATCTT AAGACCTCTA ATAGAACAAT 
12751 TTGGTCATGG ACTGTGGGGT TTTTGCCCCT CATTGTGTCA GCAGTGAGCA 
12801 TATTGTTGGC ATAGGAGGGA TATTTGTTGA ATGAATTGCT AGAGGTGGCC 
12851 AAGAGATATG ATGTAAGTCA GGCTTTTCCC TGCCCTTCCC CTTCCCCTTC 
12901 CCCACATCCT TCCTATAGCA GCCACCGTGG CTGCAGTTAC TGTAAATGGC: 
12951 AAGACGGAAT CAGTTCCGGA CATTCGGTTG TTTTAGAAAA" TTGGGTGCAA r 
13001 GTGTCAGGGT GATAAGTTAA AGCTTTGTCT TTTGGCCTCA GAGGAGCTAT 
13051 CCCATAGTGA GTAGAAGCCA GAGAAGCTGA CCCGAGGAGT GGTTCTTTCC 
13101 AGCAGCAGGT GTTGAGCTGC ACTTCTCTGT AGCTACAATC CAGGCAGGAA 
13151 CAAGCCCTAG GTACCTCCGG AGAGGAGGGC AAGAGAGGAA GAATGAGTTC 
13201 AGCTACTCTA GCCACCAAAC TGATTATGAA TTGCCCTGAA ATCTGAAAAA 
13251 TTTCAATTCC AATGGTAAGT TTGTTTTGTT TCATTTTGTT TTCTTAAATT 
13301 GTATATTTGA AAGATGGCAT TAACTAAAGA TATATATTCA ATATAGAGTG 
13351 GAAAAAATGG AATACTTGCA TAGTATCTTT TACTTATAGG TGATTTATGA 
13401 TGGGGAGTGG GGTGGATAGG TTGGCAGTTC CCCCAAGAAG TTGGAAATGA 
13451 AGTTTGTCCT CTGTGAGTTG AACTAATTAG ATCCACAAGT AATGAAAGCA 
13501 GTATTGTGTT GTAGTTAAGA GGACACTCTA GAACCAGATT GCTTAGTTTC 
13551 AAATCCTGGT TCTGCCTTTT ATTATCTGTG TACTTTGGGC AAGTTACTTG 
13601 CCCTTTGTGT GCTTCATTTT TCTCATCTAG AAAATGGAGA GGCCAGGCGT 
13651 AGTGGCTCAT GCCTATAATC CCAGCACTTT GGGAGGCCGA GGCGGGCAGA 
13701 TCACCTGAGG TGAGAAGTTC AAGACCAGCC TGGCCAACAT GGTGAAACCC 
13751 TGTCTCTACA AAAATACAAA AATTAGCCAG GCATGATGGC GGGTGCCTGT 
13801 AATCCCAGCT ACCCAGGAGC CTGAGGCGGG AGAAACACTT GAACCTGGAA 
13851 GGCAGAGGTT GTAGTGAGCC AGGATTGCAC CACTGCACTC CAGCCTGGGT 
13901 GACAAGAGCT AGACTCAGTC TAAAAAAAAA AAAAAAAAAC AAACTGGAGA 
13951 TACAGGCTGG GTGCAGGGCT TACACTTATA ATATCAGCAC TTTGGGAGGC 
14001 CTAGGCGGGA GGATTGCTTG AACTCAGGAG TTTCAAGATC AGTCTGGGTA 
14051 ACAGAGCAAG ACCTCATCCC CACAAAAAAT CAAAAATTTA GCCAGGCATG 
14101 GTGGCTCATG CCTGTGGTCC CAGCTACTCA GGAGGCTGAG GCGAGAGGAT 
14151 TGCTTGAGCC CAGGAGGTTG AGGCTGCAGT GAACCATGAC TGCACCACTA 
14201 CATGCCAGCC TGGATGACAG AGCAAGACCC TATCTCAAAA AAAAAAAAAA 
14251 AAAGAAACGA GCCAGGCGCG TTTGCTCACG CCAGTAATCC CAGCACTTTG 
14301 GGAGGCCAAG GCAGGTGGAT CACTTGAGGT CAGGAGATCG AGACTAGCCT 
14351 GGCCAACATG GTGAAACCCC ATCTCAACTG AAAATACAAA AATTAGCCAG 
14401 GCATGGTGGC ATGCTCCTGT AGTCCCAGCT ACTCACTTGG AGGCTGAGGC 
14451 ACGAGAATCG CTTGAACCCA GGAGGCGGAG GTTGCAGTGG GCCAACATCA 
14501 TGTCACTGCA CTCCAGCCTG GGAGACAGAG CGAGACTCTG TCTCAATAAA 
14551 TAAATAAACA TAAAATAAAA TAAAATAAAA TAAAATAAAA TAAAAAAATA 
14601 TGGAGGCCAG CAGGCACGGT GGCTCACGCA TGTAATCCCA GCACTTTGGG 
14651 AGGCCGAGGG GGGCGGATCA CAAGGTCAGG AGATCGAGAC CATCCTGGCT 
14701 AACACAGTGA AACCGCGTCT CTACTAAAAA TACACAAAAT TAGCCAGGCA 
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14751 TGGTGGCAGG CACCTGTAGT CCCTGCTACT CAGGAGGCTG AGGCAGGAGA 
14801 ATGGGGTGAA CCCGGGAGGC GGAGCTTGCA GTGAGCTGAG ATCGCGCCAC 
14851 TGCAGTCCAG CCTGGGCGAC AGAGCAAGAC TCTGTCTCAA AAAAAAAAAA 
14901 AAAAATGGAG GTTGGGCGCG GTGGCTCGCG CCTGTAATCC CAGCACTTTC 
14951 GGAGGTCGAG GCGGGCGGAT CACCTGAGGT CAGGAGTTCC AGACCAGCCT 
15001 GGCCAACATG GTGAAACCTT GTCTCTACTA AAATTACAAA AATTAGCCAG 
15051 GCACGATGGC AGGCACCTGT AATCCCAGCT ACTTAGGAGA CTAAGGCAGG 
15101 AGAATAGCTT GAACCTGGGA GATGGAGGTT GCAGTGTGCT GAGATCGCGC 
15151 CACTGCCCTC CAGTAGAGTG AGATTCCGTC TCAAAAAAAA AAAAAAAGAA 
15201 GAAATGGAGA TACAAACTTA CTACCTACCT CCTTACAACC TACCCTCACA 
15251 GTATTACTGT GAATAAAAGT GTGTGTAGCA CTGGGAACAC TATTCACAGA 
15301 GCACTCATGA ATGTTTGTTC 1 1 IGI IATTA GTTACTAGAG AGGCAAATGT 
15351 CTGCCAGGGC TGAATAATAT GTGTGAATTG GTGATTGTCG CACATATCTA 
15401 AAGAAGTAGT TAI I I I I I IC AATTAAAACT TAGTTTAAAA ACCAATATAA 
15451 GGCCGAGCGC AGTGGCTCAC ACCTGTAATC CCAGCACTTT GGGAGGCCGA 
15501 GGTGGGCAGA TCATTTGAGG TCAGGAGTTC GAGACTAGCC TGGCCAACAT 
15551 GGTGAAACCC TGTCTCTGCT AAAAAAAAAA AAAAAGTACA AAAATTAGCC 
15601 AGGCATGATG GCAGGTCCCT GTAATCCCAG CTACTTCGGA GGCCGAGGCA 
15651 GGAGAATTCC TTGAACCCAG GAGGTGGAGG TTGTAGTGAG CCGAGTTTGT 
15701 GCCACTGCAC TTCAGCCTGG GTGACAGAGG GAGACACTGT CTCAAAAAAA 
15751 AAAAAAAAAA ACCAAAAGCA ATATAATAAA TAAGTQGCCA ■ GCAATGAAAC 
15801 AGAAAGTGAA AAGTTAGTGA AGCAAAACTA GTACTGTATT CAGATAAAGA 
15851 TGCTGAATCT AGATTTGGTC AGCAGAATAG GGTCCTTTCT GGCAACGTGG 
15901 GCTAGTTTGG CTCACTCACC ACTGCCAGGA. TGAAATTTCT iTTCAGTGGCT 
15951 ACTCATTTCG CTTTAT7TTA AGTCCATGCT CAGAGAGCAA CCTTCTGATG 
16001 CCTAATTCAG CTTCCTGGGA TACTTAATAA CAGGAAGGGT CTGGAAGTAG 
16051 TACCTGTATA GGGGATATGA GTGTTCTGAT TTTAATAGTC AATTCATAAG 
16101 TGTACAGAGG GTTTCATAAA TGGTTAGGTC AGAACCATCA CAGAATGTCT 
16151 ACACCTCTTT GGACATTAGG AAGGTCAAAA ACCTGAAAGG CCAAAAGCTA 
16201 GGCCTAGATT AGGGTCATTC ACCAAGAAAA CATCAGCCTT GAAGAGTTCT 
16251 CTGGGTGGTC CACCAGTCAA CCTTCCTTTG ATCACACCTC GTTCCTCGTT 
16301 GCTTCTTTAA GCATTGACCT GTAATGGGTA TGGAAI I I I I TGCTCACCTA 
16351 ACTCCTTCCT TTTACAGAGG AAGAAGTTGA AGCCCAGAGA GATTTAATGG 
16401 CTTGCCTAAG ATCACACGCA GAI I I ICIGI TAACCAGGGT GAI I I I I GAG 
16451 GTGTTCCCTG CCAGACGAGG GU I I I I ICC TTCAATTGCC TAGAGATTTC 
16501 TTCAGATATC CGAAGCATTT TTCCCAGTGC AGCCTGGAGA AGGATGTCCC 
16551 TGTCAACACA GCATTTGTTA CTCAATGTTA GACATTCAAT TTTCTAATTA 
16601 GTATCATGGA GCAACAGTGG ATGATTATCT ATAAGGGGTT GCAATTCCAT 
16651 GCTTATGTGC TTACAGCCCA TATAGACAAA TATCAGCTGT TAAAATGACA 
16701 AGGCAGTAGA GATGTGGCCC CAGGACAAAG GCATACTCTG CTGTTAGTGA 
16751 ACACTAGTTC GCCAGCAAAT TTCACATGGG CATATACACG GCCAACTGTA 
16801 GACTTTAGGC ATTTATACCC ATTCAGAGAG CCAAACTGGC AACTAAAGAT 
16851 CAGCATTCTC TTTGGCATTT CAGCTTTQCG TTCTGTTAAA AATCACTCCT 
16901 TGCTTAAATA CCTCTGATAG CTCTTCACTG CCTGTAGGCA ACTCTTTAGC 
16951 CTAGCAGACT TQGTCTTTAG TGCTCTGCCC CTACTCTCTT CCACCATTCT 
17001 GGCCTCCTGT CTAATTGCTG CCCATATGTG CCATGCACTA GAGCTTACAG 
17051 ACCTGCTCAG CGTTATATGA GCATACCATA CTCTTTATGC CTCAGTGCAT 
17101 TTGCACATGT TGTTCCTTCA GGCCAGAATG CCTGTTACTG CCTGGCAATC 
17151 AGCCTATTAG AGTCTGCCAA TACCATCCCA TCTTCTGTGG AGGAGCCCCC 
17201 CGCCAAATCC ACCCATACCT CTCCCCACCA ATCAGAGACT TCTTCTCTCT 
17251 TTGTTATTCT CTTCGTTATT CTCTTCATAC CTCAGTTATA TCCATTTCAG 
17301 TAI I IGI I IA CACATCTAGC ATCACTCTTA GAGTGTGAAA TTCTCCAAGT 
17351 GTGGAGCGGT ATCTAGTTTG TCTTTGTATC CCAGAGCTTA GCAAAGTGCC 
17401 TAGAATGTAG TGGGTGCTCA GAGTGTTTGC TGGGTGAATG ATGTATTTGT 
17451 TGAACGACTC TTTGGACACT TGAATAAAGT CCATCCAGTA TGCACCATTA 
17501 CCATCTCTTC GCTCTACAAT ATTCTTTTAG GCAAGAGCTT ATCTTTTGAG 
17551 GTGATAAGAT AAGCTCAAAC TTATGTAGAC TAAGACCTCA GTCTGTAAAT 
17601 GTCATCCCTA AGTCTTAAAC CATCAAAACC AGGGCCTCAA GGAATGGCAT 
17651 GCCTTCTGCA ACTGTAGCAA CCTGCTGTGC TTATTTTGCC GTGTTTTTCA 
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17701 I I 1 1 ICCCCC AAAAGCTAGA GTCCCTTCTC CCATGGGCAG TGCTGGAAGT 
17751 GTGCTAACAA ATTCTTTCTC CATACTGCTT ACGATTACAA AAAAAACCCT 
17801 CAGCATCTCA TGCCAGACTT GAGTTAAGGT TGTTTTCTTT 7GTGTGTCAG 
17851 CTGTATTCTC GTCATGACTT CCTGATGATC CCCTATAGAG ATTTTGCTGA 
17901 GATCAGAGGG TGCTCCACTG CCATCAGTAG CACTCACTCT TGCAGAAGCA 
17951 CGGTTTCTGA AGTTGGCTAA TGTCATCCCT CAGGTTTGTT TG7TPGAAAT 
18001 I IGI 1 1 I ACT TCCAGAGATA GCACTTTCAT GGAATGACGC TATCTTCTAG 
18051 AATCACTTTT 1 1 1 1 1 1 1 1 1 1 TCAGTTGGAG TCTCGCTGTG TCGCCAGGCT 
18101 GGAGTGCAGT GGCACAATCT CAGCTCAOG CAATCTCCAC CTTCGGGGTT 
18151 CAAGTGATTC CCCTGCCTCA GCCTCCCGAG GAGCTGTTAC TACAGGCGCA 
18201 CACCCCCACT CCTQGCTAAT TTTATCTCTT TTAGTAGAGA CGGGGTTTCA 
18251 CCGTGTTGGC CAGGATGGTC TCGATCTCCT GACTTTGTGA TCTGCCTCCT 
18301 TCAGCCTCCC AAAGTGCTCG GATTACAGGT GTGAGTCACC GCGCCTGGCC 
18351 TAGAATCACC I I I I IATACC ATAACGTGAG CACCACTGCC GCGTCACCAA 
18401 GGAAAGAGAG AGGCAGCTAC TGTGGGGTTA CAAATCQGTA AGAGTGGCAC 
18451 CAGGAAGGTC AAAGTCTCTA CTTAGCCAAG GCTTAACAAA ATGTCAATCA 
18501 CCAAACATTT ATTTATTAAG CTAGGTTCAG GATAAGAAGA TGAACAAGCT 
18551 ATCTGTACAT TCATTTTCTC GTTTGTAACA AGGTAATGAT AGTGATCTAT 
18601 CCTGCCTGCC TCTGAGGGTT ATPGTGAGAA TAAAATGAAA TCAAGTGGAA 
18651 AAGCACTTAG GAAAAAGAAA AGCATTGGTT TTCAATTGTT AGTGTGGATG 
18701 AGAAACACTG GGGU IGI I I AAAATCCAGA TTCTTAGGGC GAGTCTGAGG 
18751 GATTCTGATT CTGTATATCT GAAGTGGGAC TGAGGAATCT TGATTTTCAA 
18801 CAAGCTGACC AGAGGGTCCA ATCCTCCTAT TCCTTTAGTT., ACACTTTCAG 
18851 AAATATTACT GTAAATCAAA TGGCAAGAAT AAMTAGTfA ; TTTCAGGCAG-: 
18901 TJTTAGTATC TTGGACCTGG AGTCCAAAGA CTTGGGTCAA ACTCCAGCTT 
18951 TGTCAGTTCC TAGACCTGTG ACCTTAAACA GCAAGCTTCT CTGTGAAGCT 
19001 TAGTTCCCTC AGGAACGGCT CTGGTCACCT CCTCCTGTAC TCCATTGATG 
19051 ACTCACCACA TAAGGCTCCC TGGGAGTCCC CCAAACCTTT GCTCTCTTAA 
19101 CTCGnTTAC AGCCTCGTAC ATCTCCTGCA GGTGCTGTCT TCTCCTCCTT 
19151 TTTCCAGGCC CTCCTCTGAC ACAGCATTCA TTCTCCTCTG GGAAGGGTTC 
19201 CTTCAATGTC TCTCCAAGCA CATCACACCC AGGAAGGACC CTGTGGCGAT 
19251 ATCTGTCTAT GACGAGATCA AACTACGTGA AGGCAGGCAC TAGGTACTGT 
19301 CAGTGCCCAG CATAGGCCTC GCCCATACCA GGTGTCCACA GATGCCTAGT 
19351 AAAGAAACCT ATGATTCAGG ACCCCCATGA TGAGCAACTA TAGCACTAGA 
19401 ACAGTGATAA TAACTAATGT TTATAATGCA TCTTCAGTTT ACAGAGGGCT 
19451 I I IGIACTCA TCATCTAGTT TAGTTCCTGC AACAACCTCT TGAGGAATAT 
19501 AGCACAAGCA GGACAAGGGA AGCCCAGAGA TGTTAAATAA TTTATCCAAG 
19551 TTTATGCTGC TGGGAAGGGC AGCACTGAAA TTAAAAGAAA AGTTTTCTGA 
19601 GCTCAAATCC CATGCCCTTT CCTCAATGTG AGCTCTAGCA AGGTATTCAG 
19651 GAATCCTGCC TCTACAGTTC AGAGCCTCAA ATTGCTGGGT ATGTTGAGTT 
15701 CTTGTATCTG Al I I I ICTAG ATTTCCTGCC CACATTCTTA CTGTCTGGAT 
19751 ATCAGGAAAG AGTTTATCAA ATGCCTGTGG AAATCCAAGA TAAGGTCTCA 
19801 TGATGAGTAA CCCAGTGAAA ACATGAAGTC AAGTCTAACT AGTCACTACT 
19851 ATTTCACTAC TGCTGACTCC TGATGATCAG CTCCTTTTCT AAGTGC1 IAC 
19901 TGTCCACTTA TTCCATCATC 7GCCTAGAAT TTATGTGAAG GAATCAAAGC 
19951 AAAAGGATCA TAAGGCTTCC I I I I ICCAGT ATGTTTTTCC TCCTTTTTGA 
20001 AAACTGGGCC AGTTAGCTAT CTCCAI I I I I ATTTCATGAA TACATCCCCA 
20051 GCGCCTGGTA TATAGTAGAT ATGGAACATT ACACTTTCGA GATATTGCAC 
20101 CCATTCTCCA GTTTCTCCAA AGTTACTAAC AATCGTTCCA TCACTGTGCC 
20151 AACATATTTT CI I I I I I CAA TATATTGGGA AATAATTCTC CCAGTCTCAA 
20201 AATCTGAACA CATTTCATGT GACTTGGTAT CCTCATATGT CTTGGGCTTC 
20251 CAATTCTCCA TTCCTAGTTT CAAGTTCATG AACTGTAAAA CAAAGGATTA 
20301 GACTAAATCT CTAAAGTTCT ATCCAGATGC CAAATTCTTT TCTCTTTCCA 
20351 TGATACCTAA GATAGATGCC AAATATTGTC TTTTACCTGG TGI I PGTGAA 
20401 CATGACATCA CATTACAGGA GTAGCAGATA CTAAACTCTC ACTCTGTAAA 
20451 ACACTGACTG AGTTCCATGA GCCAGATACT GAAGTGAGCT TGTTCACATA 
20501 TGTTCTCATT TAATGCTCAT AACCCTGTGA AGCTGGGAAT TGCTGGGACA 
20551 TTTTATTTAT TTATTTATTG AGACGGAGTC TGGCTCTGTC ACCTAGGCTG 
20601 GTGTGCAATG GCATGATCTT GGCTCACCGC AACCTCCGCC TCCCGGGTTC 
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20651 AAGCGATTCT CTTGCCTCAG CCTCCGCAGT AGCTGGGATT ACGGQGCACA 
20701 CACCACCACA TCCAGCTAAT TTTCTATTTT TAGCAGAGAT GGAGTTTCTC 
20751 CATGTTGGCC AGGTTCGTCA GGAACACTTC ACCTCAAGTG ATCTGCCTGC 
20801 CTCAGCCTCC CAAAGTGCTG GGATTACAGG CATGAGCCAC CATGCCTCCC 
20851 CGGGACCCTT GTTTTAGAAG GATGACTQCT GCTATAATGT AGAAAGTCAT 
20901 TTGGAAGAGG GGAGGAGTGG GGCACGAAAG ATGGTTAGTA GATGGGGGTG 
20951 GTAATGCTTA CCTTTCAGTA TTTGGAGGCT TCGGAGTCCT CAAAAATTCT 
21001 CTTCCTTGAT TGGAGTCCTC CCAGCCAATA GAGGGCTTCA CACAAACAGT 
21051 TTCTTGGGTT TTGAATTGTT TGACCAGAGC I MCI ICCGA CAAAAGGTTG 
21101 GGGTCATTCA TTCACTTACC ACACCTTGCC TGAACATTCA CTTCGGGCTG 
21151 CCGGTTATGA AGGCTATPGT TCTCCAGCCT GTCACAGACG CTTTGAAGAC 
21201 CTGTGCCTCA GCTGGTTCTA AGGAGTCAGT T7GTTCAGCT CCGTCCCAGG 
21251 TTTCCAACTT ATGAAATGTG CTGGAGATTA ACACCTCTCC TGCCATTTTA 
21301 TCCCTACTAT AATTCCCAGT CAAAGGATTC CTGCAGTTGC CTCTGGCAGC 
21351 CATMCTGAT GAATGTTCTG CCAGCTCCTC TGAGGACCTA GAAGAGCAGT 
21401 TTTCTATCCA GGACCAGTTT CCAAGGGTCG GAGGGTCAAA TATATCCTCC 
21451 AGTGTGACAT TTCATCTCCC AGTCATGGGT GGCTTGGGCC CTTTGAAGTT 
21501 GGCTCTGAGG AACCACACAC TTGGGTCTGA GCAGCCAGCA GCTTATCACA 
21551 TCTGGTCATC AATCCTTCAA AGGTTCCTCC TGAAGTCTGA Al I I I IGGAG 
21601 GTCAAATGGA TTCCACCTGG GAGGGGCTTC TGCTTCAACT CAGGACATGG 
21651 GGAGAAGGCT GTTCCTCTTC CAGGGGGAGG CAGTTTTCAT GGCATTGAGA 
21701 TGTCCTCTCA CTTATTCCCC ACCCACCCAC CAAGTCCTTT GTAAGAGGAG 
21751 TAGGGGGAGA GGAGAGCGCC TGCAGCCTCC TGCTCACATT CCTAGAGACC 
21801 GACTCACTGA GGCCGTCGCC GCTGGAAGAG GAG4GCTGTG TCAAATGTCA 
21851 AGAGGAGTTA TCCTCATAGG CTCCCTGGCC TCAGTCTCTT. TGTGGCTTGC 
21901 ATATTCTTCC ATTAGTACTG TGTTCATCAC ATGGAAATCA GAGGGTACAA 
21951 TTAAAAGATA ATTTGCTAGT CCCAGACTTA ATTTGGGGCC CCCTTCTTGC 
22001 CTGATTGAAT TACAGGGGAA CATAATAGAT TTTTGGTGAG AAATAGTTGT 
22051 CTGTGTGGCT GGGAGAAAGA TTGCTCCCAG CTCTCCAGCT GGGCAGCCCT 
22101 TTCAGTATCC CGTATGTTAT TTCCCCACTT CCAGCCCACC TCACCTCCTC 
22151 TGTGGCCCTT GTGTGTCCCC TCGGCTAGGA TCCTGACCTC CTGCTCAAGA 
22201 GTTTAAACTC AACTTGAGAC CCAAGGAAAA TAGAGAGCCC TCTGCAACCT 
22251 CATAGGGGTG AAAAATGTTG ATGCTGGGAG CTATTTAGAG ACCTAACCAA 
22301 GGCCCAGACA GAGAGAGTGA CTTGCTAAAG GCCACATAGC TAGCCCACAG 
22351 TAGTTGTAAC AATAGTCTTA ATGATATTAA TGGCTAACAT TTATCAACCT 
22401 TTAATGTGTC CCAGACTTTG TGCCAAGGGC TTACATGCAG TGCATTGTCG 
22451 CATTCAAACC CAGACAGTCT GGCTCTGGGC CCAGGC7GAG CTTTGGTATA 
22501 GCATGGTAGA ACGTTGTCTA TAATGTCTAG TCTGGGTTCA AATCCTGGCT 
22551 TCACTTCTCA CATTTACAGC TGAGTGACCT CAGGCAAGTG ATTTAACCTC 
22601 CCTGTACCTC AGTTCCTTTA TCTGTAAAGA GAAAAATCAC AGCACTGTGG 
22651 AATAGTCGGG GTTAAAATTC ATTCATACAA GTAGTGCTGC AAGCAATGTT 
22701 TAATACAGGG TGAGCACCTG TTCAGTGCTT CCTTCTTCTG GCTGCCTCTC 
22751 GGGCTAGAGT GTGGTGTCTT CGTCGTATAG ATAGATAGAT ATGGCTGAGC 
22801 TCTGCACAAA CACCAAGAGC TGTTCTTCAC TATTAGAGGT AGTAAACAGA 
22851 GTGGTTGAGC TCTGTGGTTC TAGAACAGAG GCGGGCAAGC TATGGCCCAT 
22901 TCCCTATTTT AATACGGCCT GTCATPGATT GAI I I 1 1 I 1 1 TTCTTTTTGA 
22951 GACAGAGTTT CACTCTTGTT GCCCAGGCTG GAATGCAATG GCACGAACTC 
23001 AGCTCACCGC MCCTCTGCC TCCTGGGTTC AAGCGATTCT CCTGTCTCAG 
23051 CCTCTCGAGT AGCTCGGATT ACAGGCATGT GCCACCACGC CTGGCTAATT 
23101 TTTGTATTTT TAGTAGAGAC AGGGTTTCTC CATGTPGGTC AGGCTAGTCT 
23151 CGAACTTCCA ACCTCAGGTG ATCTGCCCGC CTCAGCCTTC CAAAGTGCTG 
23201 GGATTACAGG CGTGAGCCAC CATGACTGGC CTGATTGACT GAI I I I I I IA 
23251 GTAGAGATAG GGTCTTGGTT TGTTACCCAG GCTGGTCTCA AACTTCTGGC 
23301 TTCAAGCAGT CCTCCCTCCT TGGCCTCTCG AATGCTGGGA TTATAGGCAT 
23351 GAGCCACTAT GCCTGGCCTA TATGACCTGT GAI I I I IAAT GGTTAGGGGA 
23401 AAAAAAGCAA AAGAATGCTT TGTGACATGT GGAAATTACA TGAAACTCAA 
23451 ATATCAGTGT CCCAGCCTGG GCAACAAAGT GAGACCCTGT CTCTACAAAA 
23501 AATAAAAAAA AATAAGCCAG GGCCGGGCGC AGTGGCTCAC ACCTATAATC 
23551 TCAGCACTTT GGGAGGCCGA GGCAAGTGGA TCACCTGAGG TCAGGAGTTC 
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23601 AAGACCAGCC TGACCAATAT 
23651 AAATTAGCCG AGCATCGTGG 
23701 GCTGAGACAA GAGAATTGCT 
23751 CAAGATCGOG ACACTACACT 
23801 ACACGCACGC ACGCACACAC 
23851 TGGTGGCCAG OVCGTGTGGT 
23901 GATCACTTCA GCTTAGGTOG 
23951 CTQCAGTTA GCCAGGGCAA 
24001 AAAAAAAAGA AAAAAATCTT 
24051 CATCTCCCTT AGTTTATGTT 
24101 CACAATTGAG TGQCCACGAC 
24151 TTGCTCTCTC GCCCTTTACA 
24201 CATATCTACC AGGTTTGAAA 
24251 CATCTGTAGT CCCAGCTACT 
24301 TCCAGAAGGT CGAGGTCAAG 
24351 CTCCAGCCTG AGTGACAGAG 
24401 AAAAAAAAAA CACCCTCACC 
24451 ACATAACCCC TCAGAACCTA 
24501 GTTTCCTCCT TTTACTCGCA 
24551 CACTTAACAC AGGGCCTAGA 
24601 CTTAACAGTA TTCAAACCCA 
24651 GTGTCCAGTT GGTGGAATGG 
24701 TCTTTATCAG ACTTTCCTGC 
24751 CGGTGACTTC TGGCTCTTrA 
24801 AGAGCTCATC TCACTGCCCC 
24851 TTTCCTCCAG CAGCCTTGCT 
24901 CAAGGGCTTT CTACATGGTA 
24951 GGCTGTTCAG GTGGGCTCCC 
25001 TTCGCACCCC AGGCCTTTTT 
25051 AGCATGGTTA TCACAGGACA 
25101 CAATGGATGG TGTTCTGCAT 
25151 GAGTAACCTG GGCTCCATCC 
25201 CrCCTTAAAG AGCCAGAATG 
25251 GAGTCACATG AGCCTACATT 
25301 TTGACCTTGA GTGAGTTACC 
25351 GTAGAGTGGG AATAATTCCT 
25401 GTTTGCCACA TGGAGACACA 
25451 TTCCTTATTT GCAACACAGC 
25501 TTGGTCAGCT CTTGAGGCTG 
25551 TGGGAGCCCT AGTGCCAGGA 
25601 GCTCTCTCGA CCTGTCTCTC 
25651 OTCCTGGAC CTGGCATCCT 
25701 GAATTACTGT CCTGTAGGCA 
25751 ATATGTTCAC ACTCTATCCT 
25801 GCTCAATTCG TCCCAGTTAT 
25851 CAGCCCAGCT CCACCCCTTG 
25901 CAACCACACA CCTCGGTTCT 
25951 CCTCATTCAG GGGTGGGACT 
26001 TCI I IU I AG CTAGTCACCG 
26051 AGCCTCCACA GAGAGGTGGT 
26101 TTCTGAGAAC TAGGGAGGAG 
26151 TGCTGGGGGC CAAGTGGCCT 
26201 AGGGAGAGGT GAGCTCAGGG 
26251 CCCCGGGCCT GTCATGGTGG 
26301 ACGGATTTGC AGCTGAGCCT 
26351 TTACTTGTCA GTCCCGGCTT 
26401 TTGGTCTCCG AGTTCCCCTC 
26451 GAGGGTGGTC TCCCTAAATC 
26501 CTGCCGGGCA GAAGCCAGCG 



GGTGAAACCC TGTCTGTACT AAAAACACAA 
CATGCGCCTG TAGTCCCAGC TACTTGGGAG 
TGAACCTGGG AGGCGGAGGT TGCAGTGAGC 
GCAGCCTGGG CAACAGAGGG AGACTCCGAC 
ACACACACAC ACACACACAC AGGCTGGGTA 
CCCAGGATGC ACTGGAGGCT TAGGTAGGAG 
TTGAGACTAC AATGAACCAT GTTTATACCA 
CAGTGTGAGA CTGAATCTCA AAAGAAAAAA 
TCCATAAGTA AATATCTGTT GGAACATAGC 
TTATATATGG CTGCTTTTGC CCTATAATGA 
AGTCTGTATG GCCTGCAGAG CCTAAGATAT 
GAAAAAGTGC CTTGACCTGT GCTCTAGAGC 
CTCAGCCTCA CAGCTCGGTG TGATGGCACG 
CTGGAGGCTG AGGTGAGAGG ATCACTTGAG 
ATTGTAGTGA GCCATGATGG CATCACGGCA 
AGAGACCCTG ACTCAAAAAA AAAAAAACAA 
ACTTATCAGC TATTTGTCTT GAGAATAGTC 
TTTCCTAATC TGTTAAATGA GGCTGATGAC 
ATTTAAACAT GATGGATAAT AAATGCTAAG 
AGATATTAAC TGCTCAATAA ATGGTAGCTT 
TGTGCTGTTA TCAGATGCAT TGTTGTCCCT 
GAAAAGGCTC CCTTGTAACC CCATCTACCA 
CATGGTTCAC AGTAAGAGAT AGAAGCTGCA 
CAATGGTGAG GGGTGTGTGG CTGGTAAGGG 
AAATCCAGTA GTGAGATCTG AGTGTTCTGG 
TTTTCCTTTA CAATCCTGCA GGCAGGGAGA 
GGCTCTGG1 1 TGGTCATCGT CACAACTGGG 
ATTCCAGATA CCTAGGCTTA TCAATCCCTT 
CTCCCTCATG CCCCAI I 1 1 1 CAGTTTGAAA 
AGTAGAAGAA GCTCCACTGT CCACTGAGGC 
GTGAACACTC AGTCAATAGT GAGTGAATGA 
TATTTGCAGA GAGCTTTGGA AAAGAI I I I I 
AAGCCTGGTA GTGGGAGAGC TCCAGCTCTA 
TAAATTCCAG CCCTGCCACT GACTCCCTTT 
TAATCTCTCT GTACCTCACT I I ILI IGTCT 
GTCTCAGAGA AATAAAAGAG TGCATATAGT 
TCAGGTGTAG GTTAATACTC TGGGCCTTGT 
CCTGCCCTGG AGTCGAAGTG GCACCTCCCA 
TCCCCAGGAC AGGCAGAGGG AGGGAATGAA 
CAGAACAGAT GGCAGCTCAG AGCTAGGATG 
CTACCAGAGG TCCCCCCGTC TGGTGTGGCT 

CTGLI 1 1 I I I CCA CCTCCAAGCA 

GCTCCTCTGC TTGAGGACAT CTGGGGCCAG 
GCCTTGCCCT TCCCTCAGCT CAGGATGGAC 
TGTCTX3CAGC GCCTGCCTGC AGCCTCGATC 
CCTGCAAGGT CTGTTTCCTA ACAGCTGCTC 
GCGGGAGCCC CTCCTCTTCC TCCCTCCCTC 
GAAGAAGAAG GCTAAOTGA CAGCAGCGCT 
GCCCCTGCTC AAGAATGCCA GTGTGTGTGT 
TTTCTCGGAG TCCAGAGGGG CCGCCTGAGC 
CCATCCCAGC CATGAGCCCC TGTGGGAATC 
GGAGTCCTCA GGCTCCCGCA GCTGCTCCGG 
CAGCCTGCCT GCAGCCAGAG GTGCCGGGAG 
CCATCTACAG CCGGCCTGAG GCAGTCACAG 
GTCTATCTGG TGTGGGAAGA AGATGGGGAG 
ACTTCACCTC CAGAGACCTC TTTCGGTGAG 
TCCATCTCTC CTGGCCCCTG GTCCTGAGAG 
TCCTTCTCAC TTAGTCCTTT ACCATGGGTT 
GAGGTTATAC CCAAGGAGAA TGGGCCTTGT 
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26551 GAGGTACCCC CATTATGTCC TGGAAGTGGT GAGGGGAGGG ATATACCCAG 
26601 AAQGAACmC TTAQQGAGCT CCAGCTCCCC TTCTATCCCA GACAAACCTG 
26651 AAGGAGCCrC CAAAAGATCC CACTGACCTG CCCAT7GTAG ATGTTACTCC 
26701 TTCCGGGGGG AATAGCCCAA ATAGAGTQCT GTTTCCAGCT CTCACATGTC 
26751 TTACCTCCGG GCCATGCTGC CTCCCCAGGA ATTPGTCCCA ACAAGCAGGA 
26801 TGGGCAGGTT TTCCCAAACT GTGGAAACTC GCAAGTCCTG GGTGTGGGTA 
26851 GCCTGGTACA CAGTAGGCAC CTTATAAACG TTTGTTCTCT TAATGGCAGG 
26901 CACATTTGCC TCTGGCCTTG AAGGGCTTCT GAGCTCCCAG GTCAATGTAG 
26951 TTGCTGGGGA AAGACCTGGG CGAGIGCI IC TAAGACTGGA GCAATGGGCT 
27001 TTAGAGTGTT CCTGAGCTGC TGGGCCAGCC CCCACACCTC CTCAGTCCCT 
27051 AGGCCTMGT ACCTCCACGA GCCTCTCTCT GTGGGGCTTC TCAGAGGGAG 
27101 ATGTGGAAAC TCTACCTCTA ACCTGGCTTT CTTTCCTCAT TCCCCCACTC 
27151 CACCTCCCAT AGAAACTCCC CAGGGGGTTT CTOGCCCTCT GGGTCCCTTC 
27201 TGAATGGAGC CATTCCAGGC TAGGGTGGGG I I IGI I I ICA TTCTTTGGGA 
27251 GCAGCCTGTT GTTCCAAAAA GGCTGCCTCC CCCTCACCAG TCGTCCTGGT 
27301 CGACTTTTCC CTTCTGGCTT CTCTAAGCTA GGTCCAGTGC CCAGATCTTG 
27351 CTGCCGGGAT ACTAGTCAGG TCGCCAGGCC CTGGGCAGAA AAGCAGTGTA 
27401 CCATGTGGTT TTCTGGAATC ACCGGACCCT GGTAGATTGC TGGGAAGTGT 
27451 CTGGACAGGG GGAAGGGGGA AGGGAACTGG TCCTCAATGC TGACTCTACC 
27501 AAGGGCCCTC CTAGACACTT TATCCTTTAA TCTCTCAACA GCCTAAAGAG 
27551 ATTATATATC CCCATTTTAG AGATGAGGCA AGCAGTTTGA ACAGAGTTAA 
27601 CATATQGAGC CTCACTGGGC AGCTTTTTCT GTCTTCCTGA CTTTCTCTCA 
27651 TGCTTCAGGG GGCTGCAGGT I IGI 1 1 I (_ 1 1 CTCCTAGTCG AGAGGAAATT 
27701 CTCAGGTTTG TTTTCCTCTC CTAGCAGAGA GTAAAAAAAG' GGATAGTTTG 
27751 CCTGACTTGT TCAAGGTGTG GCTGAGATTCLTTTTCTAAAG AGCCAATGGA 
27801 AATTGATCTT GAGTTTAGGA GAAAGCTTTT^ ACATGTGGAA' TTAAGATGCC 
27851 AAGTGTTGAA GTAGCCACAT TTCAGGTCCT..CATTAATTTC TCTTAATCCT 
27901 GGGAAGGCAG CTTAGGAGAA GGGI IGI ICC TTTAGGAGCC AGGAACTATA 
27951 CCCCTTTTAC CCTTCGAGAG GCAGGGAAGC CAGGGAGGAC ACAACTTCTC 
28001 AGGAAGAGGA GAAGCTAGAG CAGATAGTGA ACTCTCAACC TGAACCTTTA 
28051 AGGGCCAGAC CACTAATCCC ACCCAAGTCC ACCTCCCGTT TGTLI IGI IC 
28101 TGTCCCAGGC TTTCTCGAGA ACCTCATCTT CTTCCCCCTA CCCCCAAGCT 
28151 CCGTTTGCCC AGCTAGAGTC TGGGGGGTAC TGACTCACTT TCGTAGACAT 
28201 TCTTCCCTTC CCCAAATAAG AGGCCACATT CCTCAAGTCA CTTCTGAAGA 
28251 GATAGCTGCC ACACAGGGCT CTTTCCCCCC AGGGAGGGAC CACCCAGACC 
28301 CTCTGCTCTC CCAGGTATCC GTTACCACAT CACTACCTGG TCAGAAAGCT 
28351 GTTTCTGCCA TTAGCCCCTC CCTCTTTTAT TATAGGATAT CCTCAAGGGC 
28401 TCCTCTTTGG GCCTCAGTTT CATCCTTGGC AGAAAGTAGA AGCTAGACTT 
28451 CTTGGGCTCC TGAACAGGGT CO PGCTGGA TTCTGTGAAA CAAATTAAGT 
28501 TCTTGACCCT AGGCCTCTGG GGGAGTACAA AGTCTATGGG AGTTCTGGGG 
28551 CTGTGGTTGC AAGGAAAGTC ACGCAACCAG ATTCCATGGG GACATGATCA 
28601 GGCGTGACAT GTGAGGGAGG AAGAGGGAGC AAGGGAATGA AGAATACAAC 
28651 TTCTGTGTCC CATACACCCC TGCCTGACAG GCCATACATA CTCAGCAGAG 
28701 AATGCACTCT CTTTCCTACC ACACTAGCGT GAGGAGTGAG CTGCAATTAC 
28751 CACIGIGLII CCAAGTAAGA AAATACCTCA AATTGGAATT TACAAAAGAG 

28801 GTAAATTAGG GAGTGGCTTT TGTCGGACAT CTTTAAAGCA I I 1 I I < 1 1 1 1 

28851 TATAGAATTT CACTTAATGT CCAATACTGA TTTAATGAGC TTGGGTTTAC 
28901 ACATTATCTC TTGAAGAAAA CAAATGAACC I I IGlGl ICC AAAGCAATCC 
28951 AIGI I IAAAG GGAAAAAATT ATGCATAACT CTCCCCAGCT TCACAGTAAC 
29001 CTTTGGCAGG TGCCTTAGGT CCTCTGGGAC TCTTTTCCTT ATCTGAAAAA 
29051 TGAAGGACTT GGATCAGGTG AATQGTTCCC AGCTCTGCAA CTTATGTCGC 
29101 TCCTCAGAGG CACACAAGCT CTTTTCCATT ATTTCCCAAA TAATGGAGGC 
29151 CCTGTCTTTA ACTGCAGTAC AACTACACAA AATACTTGAA ACTACAGTCT 
29201 TCCTGGTTTT TGGTTGGAAC TGAATCAGTG CACTCTAGCA ACACTTATTT 
29251 CTTCCTGTTC GTAGGCTTCA TTATGTGTTT GGTTAATTTT TTAAAACAAC 
29301 AATAACATAT TCCATAATAA TTACAGCTTA ATTGGCAGAC IGI I ICAGTC 
29351 TATAGGATCT GCAGGAAGGA GGAGTAATAA AGGGAI I I I I GACTGAGCTC 
29401 TTATGGAACA GAGTCTCTCT AGGCCCCTGT CATATCTGCC CTTCTGGGCC 
29451 CTGGGGAAAA GTTGGCATCC CCAGTTGTGG TGCTCTCCAG GTGCCCTCAG 
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29501 GCTGTGGTGG AGGGAGCTTC CCATTCTCTC CTTCAGCCCA CTCAATTCAG 
29551 AGGCTAGGGG CTCAAAGAAG CTTCTCTACA ACTGGCTGTT GACTGGGAGG 
29601 TTAAGGGATC ACCATCCAGC CAGGCCTTCC TCAGGACATG GGAGGGCTTA 
29651 TGCTTTAACA TCTGTAAATC CACTGCAATA ATGACTGGTT CTTTTACCCC 
29701 ATAAGGTTGA GAATTTACCT GTAAACATTT TTGTCTGAAG AATTTGGATG 
29751 TMGTCAGGG CTGGGCCTCT ATCTTATCTC ACTTGGCTTC TCTCAGCACA 
29801 GCACCTTGCC TGCTTCTTCT TACACATCCT AGATGCACAG TAACTATTTC 
29851 CTAATTATTA GAAATCTATT AGAATCAATT GATTTCAGCT GGGCTTCGTG 
29901 GCTCCTTCCT GTAATCCCAG CACTTTGGGA GGCTAAGGCT GGAGGATCAC 
29951 CTCAGTCCAG GAGTTTAAGA CCAGCCTGGG CAACATAGGG AGACCCTGTC 
30001 TCTACAAAAA ATAAAAAATT AGCCAGGCAT GGTGGTGTGC ACCTGTAGTC 
30051 CCAGCTACTC AGGAGGCTGA GGCAGGAGGA TCTGTTGAGC CTGGGAGGTC 
30101 AGACTACAGT GAGCAATCAT TGTCCCACTC CACTCCAGCC TGGGTGACAG 
30151 AGTAAGACTC TGTCTCTTAA AAAAAAAAAA AAAAAAGTTG ATTTCTATTT 
30201 GGATAGATAA ATAATTCATT TTAGGACCTT TCTTTTTCAC TTACAGAAAT 
30251 CTCTTTCATT CTGGGCTGAG AAGCAGGTCC ATATPGCTAG GCATAGGAGA 
30301 AAAAGGGGTC TGTCTGCATT TGCCCTTGGT GGTCTCAAAT TGGGGAGGGA 
30351 AAGAAATGAA CACTTACTCG CTACCTTCTG TGAGCCAGGC ATCATCCAAG 
30401 ACATCTGTAC ATAATTTAAT TCTCATAACC CCATAAGATA TTATTAGCAA 
30451 TGTACAAGTG AGGAAACTCA GGCTCAGAGT CATGAAGTAA CTGGCCTTGG 
30501 GTCACACAGA TQGTAAATCG CAGAGAAGGA ATATGGATCC AGGTCTTGAA 
30551 AGAGAAAATC TCAACTGATT ATCTTTTTTA AAAAACTCAT ATGTTCTCTG 
30601 CTGACTCAAA AGGTCTCTGT GTGGATCTGG GTTCAGCCAC 7GAACTGAGC 
30651 ATCAGGGTTC CATGCACTTT GTATCTGCCC AAGCCCTCAG AACCCCTCAG 
30701 TAAIGI I I IG GAAGATGAGT TTTGGAGGTT GTCCTTAGGG ATAGCCTCAG 
30751 GGTATGTAGG CCTCTAGGTG ATCTCCCCTA ACCTGAGGAT TTCAGCTCAA 
30801 TTCACTCTGG CTCCTCAGGA CAGTGGGATG ACTGGTTCAG ACCTCAGCTT 
30851 TACCACCTCC CAGCTGGGTA CTCTTCTACC TACAGCCAGG GCAGATTTTG 
30901 ACTTTCACTT GAAACTTCCA AAAATTGAAA GGTAGAAAAA CAGCCTTGGC 
30951 TTTCGGAAGA ACGTATGATG TCCATGGCCT CTAAGCATCT GAGGTQGGAC 
31001 ATGTTCGAGT AGCACCTTAC AGTTCCAAAG TGTCTTCTCG GTTCTTTGTT 
31051 TAAAAGAACA GAGACTGCTG GGGAATTCAA CACTGTCAAG TATATGAAGG 
31101 AGGAGAATTG TGCTATTTAA CATTCAGTAC TTGGGCTAAA GGAGAAGCAT 
31151 CACGAAGTGT TAACACTCAA AGGGTCTTGA GCTGTCAGGG CTCCAGCTTC 
31201 CTTATTTTCA CAGGTGAGAA TCCTGAGGCT CAGCTGTTGA GATGTGCTGT 
31251 CTCACTCCGG TGACATAGTA CAGTGGATGT GGCTTTGCAG CCAAGCACAC 
31301 ATAGCTTCAC ATTGCAGCTC CATCAATTAT GTATTGGGCA GCTTTGGAGA 
31351 ATGATTTGAC TTTAACTCTG CTTTTCAGTC TTGTGTAAAA CAGGGATAAT 
31401 CCTGCTACGG TAGGGTTGTC AGGATTAGAG ATAATATAAA TAAGGTACCT 
31451 CATATAGGAC CTGGATTATG GCTGGCATTC AATAAATAGT AGCTGTTAAT 
31501 TGATAGCTAA GCTAGAACTC TGAAGTCTAC CATGGCAACT TCTTAAGTGG 
31551 TCTGAGAACC CAGTTGTGTT CTGTGGCAAA ACACAGCTTA GGGATCCATA 
31601 CCCAGCCCTC CTGTCAGCTC TTCACCTTCC AG I ICI I GAG AGACATGTGT 
31651 GGCAGTCACT TTGGCCACAT AGCTGGCTGT GCCCTTTAAA GGCATTCCTT 
31701 GACACAGATA TGTGGACTGG TGACGTTGCT CTCCAGCCAG GTGTTCTTCC 
31751 CAGCAGGCTC GCCTGGCTGT CTCCTGCATG CCTGTACTTG TTTGTCTCCC 
31801 TGCTCCCTCT CCTGGGCCTG GCCAGAGCTA CTTGCAGCAA ACAAAAGCAG 
31851 GATATTCGCA ATGGAAAGGA GGGTGTG1 IC TGGTGCTCCC ATGCCCTGCG 
31901 GCGCACATAC CATTGCAAGG GCGTAACAGA GCCCAGGCCT GCATTTGGGT 
31951 GCAAATAAGT CTGCACACAG AAGAAAAGAA GGACCTGGTG ACCAGGAGCC 
32001 ATGGAACCCT TGTGCTCCCC TACCTGGGCT ACTCGTTCTT GCCACTCCTA 
32051 CCATTTTCAG TTTGGAAATA I I IGI IAAGG CTTTGCTCTT CCAGGTCCTT 
32101 TGCTTGGTGC TGAGTCTACC AAGAGTAAGT GGGATCCTGT I I I IGICCTC 
32151 AGGGAGCTAA CAGTCTAGTG AAGAAGAAAG ATGGTTGCCC AGGAACTTCT 
32201 AAGTCAGAAG GCAGGAGGCA AGAAGGAAGC CCCTGCTCCT ACTGCCAGCC 
32251 CTCTGTTGGG CACCCCATAG TTCTTCAGAA CCACATTTAA TCCTCACTGC 
32301 AGGCCAGGCA TAGTGGCTCA CACCTGTMT GGCAGCACTT CGGGAGGCCA 
32351 AGGCGGGCAG ATCACTTGAG GTCGGGAGTT CGAGACCAGC CTCACCAACA 
32401 TGGGGAAACC CCGTCTCTAC TAAAAATAGA AAAATTAGCC GGGTCTOGTC 
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32451 GCATCCGCCA GTAATCCCAG CTACTCAQGA GGCTGAGGTG GGAAAAFCAC 
32501 TTGAACTCQG GAAGCAGAGG TTCCAGTGAG CCGAGATTGT GCCACTCCAC 
32551 TCCAGCCTGG GOGATAAGAG CAAAATTCCA TCTCAAAAAA AAAAAGAAAA 
32601 AAGAAAAAAT CCTCACTGCT ACCTTGAAAG TAGGTGATGA CATTGCCATT 
32651 TCACAAATGA GAAGTGAAGG GGCTAGCCCA AGATCACTTA GGTGGTAAAT 
32701 GGTGGTGCTA AGATTAGAAC CTCAGATCAT CTAGGGAAAA ACACAGATAT 
32751 GCACAGAGTT AAGGGGACCC AGGGTATTGT TTGTCCTCTT GTTTCACAGG 
32801 TGGGGAAACA ACCCAGAGAG GGAAAGGGGC TTGTCCAAGG CAATTTAGCA 
32851 CCCAAGAACT TGAACCCATA TCTCTCTCCT CCTGATTTAG AGCTCATCCC 
32901 ACATGTATCT TATATTGAGA GGAGTGTGAG CCACATACCA AGAACAGTCT 
32951 TCCCCTCTGC CTCCAACCTC ACTGTGCAGT TTTGAGACAC TTCACAGCCA 
33001 TACTCTTCAT GCCATACCCA GCCCTTAAGA CCCTGAAGTT CCCCTTCCAT 
33051 AAGACAAGTA GGAAAAGCTA TAGGGTAAAA ATAGCCATCA GTGTTTGTTG 
33101 AGCACCCAGG AGGAATTGGG CACTCCAGAA AGATAAAGGG ATTCTCAGGG 
33151 ACTTCCTTCT CTAGACTTCC CTAGCTCAGC TGCTTCAACT CATTCCTCCC 
33201 CCTCTTCTCT ACCTCCCGCA GTGCTCAGAA GTAGTAGAAC TCACTGTCGC 
33251 CTCTCACCTT GCATTGTTGA GTTTTATTTA GACTTTCTCT TCCTCAACTC 
33301 TTCATAAGCT CATGAAAGGT GAAGTAGGGT GCCCTGTGTA TTTATCTTTT 
33351 ATATCTGCAG TGCTTAGCAA GTTATAATAA TGCACTTCCC TGGCAAAAGG 
33401 CTTTCTCTCA TACATTAGCT TATTTCCTCT TGACATTGGC TCTTTGTAGT 
33451 AATAGGATGG TATTAGTTAT TTTCAATCAG AGAAAGCTAC •TAAGAGAAGT 
33501 TGTCCAGCTA GTCACAGTAA GTGGCTGATA AAGTGAGCTG CCATTACATT 
33551 GTCATCATCT TTAATAGAAG TTAAGACATA GTGAGTTTCT- ACTATATTCG 
33601 GTCTTTTTTT I I I I I I I I I I 1 1 I I I I I I IA GAGAGGGAAT CTTGGTGTGT 
33651 TGTCCAGGCT GGAACGCAGT GGTCCAATTT TGGGTCACCA GAACCTCCGC 
33701 TTCCCAGGTT CAAGCGATTC TCCTGCCTCA GCGTCCTGAG TAGCTGGGAC 
33751 TACCAGTGCA CGCCACCACG CCCGGCTAAT TTTTGTATTT TTAGTAGAGA 
33801 CAGGGTTTCA CCATGTTGGC CAGGCTGGTC TTGAACTCCT GACCTTGTGA 
33851 TCTGCCCGCC TCAGCCTCCC AAAGTGCTGG GATTACAGGT GTGAGCCACC 
33901 GCGCCCTGCC TATATTAGGA CTTTTATATA AGCTATCTCT AGCTAGCTAG 
33951 CTAGCTAGCT ATAATGTTTT TTGAGACAGA GTCTCACTCT GTCACCCAGG 
34001 CTGGAGTGCA GTGGCGTGAT CTCGACTCAC TGCAACCTCC ACCTCCTCGG 
34051 TTCCAGTGAT TCTCCTGCCT CAGCCTCCCG AGTAGCTGGG ATTATAGGTG 
34101 CATGCCACCA GGCCCAGCTA AM ! I I IGTA I I 1 1 I AGTAG ACCAGGTTTC 
34151 ACCATGTTCG CCAGGCTGGT CTCGAACTCC TGACTTCAAG TGATCCACCC 
34201 GCCTCGGCCT CCCAAAGTGC TGGGATTATA AGCATAAGCC ACTGTGCCCA 
34251 GCTGCTCTCT ATATTTTTAA TACATATTAT TTCCATTAAT TTTCACAGCA 
34301 GTTCATTTTA TAGATGAGGA AACTAGGCCA GAGAAGTAAA ATATCTTGCC 
34351 CAAGATGATC TAACTAGTAA GTGGCAGGAT CAAGATTCAA ACCAAGCAAT 
34401 GTTCAAACCT CTTGGAAGCA AGAATGTGGC CACTGTGGAA GGTGCAAGGC 
34451 CTTGACAACA AGAATAGGGA AAAGAAGGAA CTAGAAGGAA AGAGATGGCA 
34501 TCGGCTCAGC AGGCCAGGGA GCTCTTAGCT GTGTGTGTTG GGAAGCTCAG 
34551 AAGGGAGGAA GAGGTTGTCT GTGCAGGTAA GTCCTGAGAA CACACCAGAC 
34601 TTTTGAGAGG TGGAGCTTCA TAGCCAGGTC ATTAGGGGAG AAGGGAGCTA 
34651 TAGAI I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 IAG AGACGGGGTC 
34701 TTACTATGTT GCCCAGGCTG GTCTTGAACT CCTGGGCTCA AGTCATCCTC 
34751 CCACCTCAGC CTCCCAAAGT GCTGGGATTA GAGGCATCAG CCACCCCGCC 
34801 CAGCGAGCTA TGGATCTAAC ATGTACATCT TACACAGTGC TAATAGAATG 
34851 TTGGGTTTCT TCCCCAATAT TTTATTTTGA AAAAAAATTC AAATATATAG 
34901 AAAAGTTGAA AAATGTAGTT CAAAGAACAC CTACATACCT TTCACATAGA 
34951 TTCATGATTT GTTAATGTTA TGCCACTTTC TATATATCTC TCTCCCTCCT 
35001 ATCTGTATAC TTTTATTTAT TTAI I I I IGC TGAACTATTT CAGAGTAACT 
35051 TAAAGGCATC TTGATTTTAC CCTTGAACAG TTCAATATGT TTCTGCTAAG 
35101 AATTCTCCTA TATAAGTCAG ATATCATTAC ATCTAAGAAA ATTCACGGCA 
35151 ATTTTACAAT ATAATATTAT AGTCCAAATC CATATTTCCT CAGTTGTTCC 
35201 AAAAAATGTT CATGGCTGTT TCCTTTTTTA ATCTAAATTT GAATCCAAGT 
35251 TTGAGGCATT GTATTTGGTT GCTGTGTCTC TAGGG1 I I 1 1 AAAATCTGTG 
35301 CCTTTTCTTC TCCCCATGAC 1 1 I I IAGAAG AGTCAAGACC GGTTATTCTT 
35351 ATAGAATAAC CCACATTCTA GATTTGCCTG ATTAGI I I 1 1 TTATACTTAA 
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35401 GGTAI ITTTC GCAAGAACAT TACATTQGTA AQGCTCTTCG TGATGGGTCA 
35451 GTTTTGAAGA GTGGAGATGA TTAAACTGCT TTTGTTCATT GAAGTATCTC 
35501 TCAAGACCAG AGATCCTTAA CTCGTGCCAT AAATAGGTTT CAGAGAATCC 
35551 TTTATATATA CACCCTGTCC CCCACCTAAA TTATATACAC ATCTTCTTTA 
35601 TATATTCATT TTTCTAGGGG AGGU IU IG GOTTTATCA MTTCTCAGA 
35651 GGGCCCCAAG ACCCAAAGAG GTTATGAAAC ACTAGTCTGT CCACTCAGGC 
35701 AGGCAACACA GAGCTGG1 1 1 CTCGGGCCTT GTTCAGTCTG AACCAGCTTC 
35751 CCTTCGGGAG ATAGCACAAG GCTGTAACTT TGCCCCATCT TGGCTTTGGA 
35801 TCAAAGAGGA CTCTCCATTT TGI IGICATA CCTAGGAACC AGGGACAGCT 
35851 TATGTGGCCT GGTTCCAGGG ATCCAGGAGA ATTTCAGTTC TTGTCTTGCC 
35901 TTTCAGGTGT TCAGAATGCC AGGATTCCCT CACCAACTGG TACTATGAGA 
35951 AGGATGGGAA GCTCTACTCC CCCAAGGACT ACTGGGGGAA GTTTGGGGAG 
36001 TTCTGTCATG GGTGCTCCCT GCTGATGACA GGGCCTTTTA TGGTGAGTGA 
36051 ATCCCTTCAT ATCTGCCCCT CTTGGTCTTC AGAGTCCATT GACAGTGC1 1 
36101 CCAGTTCCCT GTGGCCTGTT AATCTTTTAG TCTTTCCATC AGCCAGGGCA 
36151 TCTCCCTTTA TTTATTCATT CATTCAACTA GCAGGTATCA ATTGAGCACC 
36201 TACTAAGTGA AAGGTAAGAT CCTTCCCTCA AAGACTTAAT AGTTGAACGT 
36251 TGGGAGTGGG AGGAGAGGCA GGCAGAGAGG AGACACAATA TAGT7GGATA 
36301 AGGACCTCCA AGGAGAGTGT TACAGGCTGA GAGGAGGATA TACTTAGGTT 
36351 GTCTTTAGGG AATCAGAAAA GGAGACTCTG GAATAGGCTG GCAGAGAGAG 
36401 GGGCTACCTC CTATACCTGC TCTGGACAAA CGACTTTAAG CATAGTCACA 
36451 GATTTGCCAA CCCTGTATTG GAAGAACTGA TCI I 1 1 I I AG TGGGGATGAT 
36501 TACTTCTGGG GATTTCTTCT CATAACTGAG ACCAAAACAG: TTTTGTGCAG 
36551 TCTCAGAAAT GACAGGAGGT ACCAATCTGA CACTTCGTTT- GGAAGCTCTA 
36601 GGGCAGAGAG TGAAAGAGTG GATTTTGACG . GGGGCCTTGG : TTGGAGGTCA 
36651 TTCACCCACC CCTGTCCTCA CTCCAGCAAC AGTGATAACT CACTTCCTTC 
36701 CTCCCTTTGT ACACCCTTCT CCCCACCTGC TCACAGGTGG CTGGGGAGTT 
36751 CAAGTACCAC CCAGAGTGCT TTGCCTGTAT GAGCTGCAAG GTGATCATTG 
36801 AGGATGGGGA TGCATATGCA CTGGTGCAGC ATGCCACCCT CTACTGGTAA 
36851 GATAGTGGTC CTTTGTCTAT CCTCTCCCAT ATAAGAGTGG CTGGCGGGGA 
36901 GGGACAGTGG CAGGGTGAGT TGGGCAGAAG GAGTGTTAGG GTAGTCAGAG 
36951 CATTGGATTC TTACCACAGC AGTGCTCTTA ACCAGCTCTT TAACTTGTAA 
37001 GCAGAATGAT TTACACATGT CTCTACCCTT TTTCCTTACC AACCTTGAAA 
37051 ATGTCTTCAC TCTGCCCTGC AATCGTCCCA GTGGGAGGCA CTCTTCAAGG 
37101 ACGATCCCAG AACATTAAAG TCAAAGACCC CTTAGAGCTC ACCCTGTCCA 
37151 ACCACCTTGG TTGATAAAAG AAGTCAGCCT GGGGCCCATG GAATAGAATA 
37201 GTACAAGGGC AAGGTTCTCA TTGTGAGTCA AAGGTAGAGT GAAGAGAACC 
37251 CAGACCATCT CACCCCAACC CAGGCCAGTG I I 1 1 ICCAAA TATACCACTT 
37301 GCTGCAGATC TAGCTCAGCA CCCCCAGTCC CAGCCCACCC TGAGAACCCA 
37351 GGCTCCTCAT TCTGAGCAGC CAGCTAGAAT CATGACAAAG AGGGTGGTAG 
37401 TGAGACTATG GGTACTGTTG CTTAAAGCCA CATGGTGCAG TGGTTGCTGG 
37451 GGGGCTTCTG TGTGGGACTC TAGCATCTTA TTCCCCCCTG TGCCCTCTCC 
37501 CCAGTGGGAA GTGCCACAAT GAGGTGGTGC TGGCACCCAT GTTTGAGAGA 
37551 CTCTCCACAG AGTCTGTTCA GGAGCAGCTG CCCTACTCTG TCACGCTCAT 
37601 CTCCATGCCG GCCACCACTG AAGGCAGGCG GGGCTTCTCC GTGTCCGTGG 
37651 AGAGTGCCTG CTCCAACTAC GCCACCACTG TGCAAGTGAA AGAGTAAGTA 
37701 TTTTGAGAAC CCTTCAGCAG GGGTTCTTGA GCAGAGTCTG TAAATGGGCC 
37751 TCAGAGGGCT TAGACCTCCA AAGTCTCATG CAGAACTCCC TTTATTCTCA 
37801 TCTCATATCT TTCTCCTGGA CCCCACTATG CTGTAACCGT ACCTGGGCCT 
37851 TGGCACTTAC TGTTCTCTCT GCCCAGGCTA OTCCTACCC GATACTTAAG 
37901 GCAAGAATCA CTCACCTTTC AGGTGTCAGG TTTCAGGTCA TGTTTGCTCT 
37951 TTGAAATCAT CTGGCTTGAT TATGTGTATT AG I IGI I I AT CTTCTATCCC 
38001 CTCCACTAGA ATGTAAATTC CAGAAGAAAC TTGCTGTCTT ATTCAGTGCT 
38051 GCATGCCCAG GGCTTGGAAG AGTACCTGGC ATATAGTAGG AGTTGATTGA 
38101 TTATTATTTT GTCAGTCGAG AGAATGAATG GAGAAAATGT GGTCCATGGC 
38151 CCAAAAGAAG TTAAGACCCT ATCCTAGATT CAGGCCAGAG ACCAGATGGA 
38201 GAAAGAGTCT GTGTCTATCT AATACCAGTA ATGTCGTACC TCTGGCCGCT 
38251 TACCATGTAA ATATTGATTG TGTATCTACC ATGTGTTGGA CACTAGGCTA 
38301 GTGCTTGCAC AGCAGGTGAA AGATACTAGA GTTTGGGAAG TCAGGAGGAG 
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38351 CTAAGGTCTG TTCTACAACC TTATTAGATG AAGAGGAGAG GGAATTGTG1 
38401 TCAGGGCAGA GGGAGAAGCA TTTCTCOVAA AGTAGGAGTC TTAATCATCT 
38451 CTGATCTAGG TTCAGTGTGG CCAGAAAAGG GGCTGTTAAG TATAGAGGGC 
38501 CTGGATTATG AAAATCCAGC AGATCCATTG AGAGTTTAAG CAGCAAGGTG 
38551 TTGTGACCAA GTTAACATTT TAGAAGGATC ACTGGTATCG AGGTTCGATT 
38601 GGAGAGGGGA AAGCCTAAAG GTATAGAGAC TAGTTAGGAA GCTATTCTAG 
38651 GCPGGGCATG GTG61 1 CATC CCTCTAATCT CAGCACTTTC GGAGGCTCAG 
38701 GTGGGAGGAT TCCTTCAGGC CAGGAGTTGA AGACCAACCT GGCCAACATA 
38751 GCAAGACCCC GTCTCTCTTT TTCTTAATTA AAAGAAAAGT CCAGACGTAG 
38801 ACATAGTGGC TCACGCCTCT AATCCCAGCA CTTTCGGAGG CCAAGGTCGG 
38851 CAGATTCCTT GAGGTCAAGA GTTTCGGATT AGGCCAGGCG CAGTCGCTCA 
38901 CGCCTCTAAT CCCAGCACTT TCGGAGGCCG AGGTGGGCGG ATCACAAGGT 
38951 CAGGAGATCA AGACCATCCT GGCTAACACA ATGAAACCCC GTCTCTACTA 
39001 AAAGTACAAA AATTAGCCGG GCATCGTCGC GGACGCCTGT AGTCCCAGCT 
39051 ACTGGGGAGG CTCAGGCAGG AGAATCGGGT GAACCTAGGA GGCGGAGCTT 
39101 GCTCTCAGCA GAGATCACGC CACTGCACTC CAGCCPGAGC GACAGAGCGA 
39151 GACTCCATCT CAAAAAAAAA AAAGAGTTTC GGATTAGCCT GGCCAACATC 
39201 GCAAAACCCC ATCTCTACAA AAAGTACAAA AAAATTAGCT GGGTATGGTG 
39251 GTGCGCGCCT GTAATCCCAG TTACTCAGGA GGCTGAGGCA TGAGAATTGC 
39301 TTGAGCCTGG GAGGTGGAGG TTGCAGTGAG CCCAGATCAT GCCACTCCAC 
39351 TCCAGCCTGG ATGACAGAGT AAGATGCCAT CTCAAATAAA AATTAAAAAC 
39401 AAAGTTTAAA AAAAAAATAG AAGCTATTAC CGTGATCCAG GTAAGAGATG 
39451 TGAATAACTA CAATGATGGA AAGAAGGCAG AGTTGTTAGA GATGGGAGTA 
39501 GGAGAGATGA GGGAACTCCA GATTCGGAAG ATGATGTTCA AGTTTCTGGC 
39551 TTAGGCCACA GGGTGAGTCG CAATTCCCTT CACTCAGATC GGGGATCCTC 
39601 GAAAAGGTGT TGCCTTTCTG TGTGGGTATC CTGGGCCCCT TAGGGGCCAC 
39651 TGGTGGCCTG GGACCTGGTA AACCTTCCCT GCACAAGCAG AATTGGTCAA 
39701 GCAGGI 1 1 1 1 AGGACATCTT TACCCTGCCT CAACTCTTGT CTGGCCCAGG 
39751 GTCAACCGGA TGCACATCAG TCCCAACAAT CGAAACGCCA TCCACCCTGG 
39801 GGACCGCATC CTGGAGATCA ATGGGACCCC CGTCCGCACA CTTCGAGTGG 
39851 AGGAGGTAGA GTGTGTGTCT AATCTCTCTT GTGAGGGTGG GACATGGAAC 
39901 AGATCCTCTG GGAAATCAGG CTGTAGCCTT TACCTTTTCC TACCCCCAGC 
39951 CCATCTCTTT GTCTTAGCAT TGAGCCTGTG ACCACTGGTG ACCTATTTCA 
40001 GCGTAACAGG TTCCCAGGGT AGCAGGGATG GTTCATCGAC GGGAGAGCTC 
40051 ACAGGATGCC AGGCAGAGGG CACTCTGAGG CCACTGGCAG CTAAAGGCCA 
40101 CCATTAGACA AGTTGAGCAC TGGCCACACT GTGCCTGAGT CATCTGGGTT 
40151 GGCCATCGGT GGCCTGGGAT GGGGCAGCCT GTGGGAGCTT TATACTGCTC 
40201 TTGGCCACAG GTGGAGGATG CAATTAGCCA GACGAGCCAG ACACTTCAGC 
40251 TGTTGATTGA ACATCACCCC GTCTCCCAAC GCCTGGACCA GCTGCGGCTG 
40301 GAGGCCCGGC TCGCTCCTCA CATGCAGAAT GCCGGACACC CCCACGCCCT 
40351 CAGCACCCTG GACACCAAGG AGAATCTGGA GGGGACACTG AGGAGACGTT 
40401 CCCTAAGGTC CCACCTCCCA CCCTGGCTCT GTTCTGTCCT ATCTCTGTCT 
40451 CTCGGATCAA GCTGAGCTGG CTTTCAGAAG CCTGCAGAGT TAGGAAAGGA 
40501 ACCAGCTGGC CAGGGACAGA CTATGAGGAT TCTCCTGACC CAGCTGCCCC 
40551 TCTCGGGATC ACAGTTTACA GCCAGAGCCT GTGCGGACCC AGCTCTCTGC 
40601 CAGGTTTCCT TAGAAACCTG AGAGTCAGTC TCTCTCCACT GAACTCCTAA 
40651 GCTGGACAGG AGGCAGTGAT GCTAAACCCT GAAGGGCAAC ATGGCCTATG 
40701 GAGAAAGCAT GGAGCTCAGA GCCTQGAGTA CGGGCACAGA TAGGATTCAA 
40751 TAAATTCTGT AGAAAGACTT TGAAAACAAT AAAGCAAAAG ATGAATCAAC 
40801 Gl 1 1 1 1 1 1 IA GACTTGAGGG ACCAACAACC CCCAAACCCC AGATTCTGCC 
40851 AGGTCCATGG GGAAGGAGAA GTTGCCTTGA GTGGAAGCCC CAAGTAGGGA 
40901 GACTTACAGA AAAGAAGTCA AGAGCACTGG CTCCCAGGCA GAAATACTGA 
40951 TACCCTACTC GGGCTTCAGG CTGAGCTCCT CCCTTCACAA ATCACTTCAT 
41001 CTCTCTGAGC CTGTTTCTGC ATCTGTGACA TAAGATGGTA AGATAAAGGT 
41051 GGCTCTCTCA CCAATTATGT AAGGATTAAA TGTGGAAAAG GACATAAAGT 
41101 TCTATAGTGC TGCCATAGGG ACAGTCTTCA GTAAACGTGA CACATTCTTA 
41151 GTATCACTAA GAATCAGGTT CTTGGCCAGG CACCGTGGCT CATGCCTGTA 
41201 ATCCCAACAC TCTGGGAGGC CTAGGTCGGA GGATCGCTTG AACACAGGAG 
41251 TTTCAGACCA GCCTGAGCAA CATAGTCAGA CACIGTCTCT ACAAAAAAAA 
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41301 AATAATAATA ATAATTCTTT TTAA7TAGAT GGGCAGGGCA CTGTGGCTCA 
41351 CACCTGTAAT CCCAGCACTT TCQGAQGCCA AGGCCQGAGG ATTCCTTCAG 
41401 GCCAGGAGTT CAGGAGCAGC CTCGGCCACA TTCCTGTCTC TACAAAGAAT 
41451 AAAAAAGTTA ACFQQGCATG GTCGCACATC CCTGTAATCC CAGCTACTCA 
41501 AGAGGCTCAG GAGGAGGATT GCCTGAGCCC AGGAGTTCAA GACTGCAGTG 
41551 AGCCTTGATC ACACCACTGT ACTACAGCTT GGGCAACAGA GTCAGACCTT 
41601 GTCTCCAAAA AAAAAAGTTT Gl I I I I 1 1 I I ATCCACTCTC CTCACCAAAC 
41651 AAACTGAGTA AGTTAGAGCC CTCTCAGCTG GCATGTUTTG GAAACAGTGC 
41701 CCTCTCATTA AAGTCCTCCC CTCACTCCCA TTGCCTCTTG GCCTTOGTCA 
41751 GTATCATGAA ATTAGTGGGA GGCAGGGCAA CAGAGGGCAG GGAAGAGCTA 
41801 GAAATCCATG GCCTGGAAAA GGGAAGATTT GGGAGTGGCC AGGTATCTGT 
41851 AGAGCCACCA TGCAGAGGAG GGGGGCAGCT AGCCTTGTGT GCTCTCGTCG 
41901 GCATGGTCAG CAGGAGGCAG AGCAAAAGGA CAAGGGTAAG TAAACCTGTA 
41951 GGTCGGGACA AGCCAAGAGC CATCCAGGGT CAGTCCTCTC TCGGTAGCCC 
42001 AAGTAAAGCA GGAGCATACC CCAGAGAGAA AGTTCGCAGG GCTGTTCACC 
42051 TGCAGTGCTG TGGACTTCAA CCTTCTTGTT CCTTCTTCAG TAAGTCAAAA 
42101 TAACAGTCAT TGACCATGAC TATTATCGAC CGCTTTTGAA AATGTAAACA 
42151 TAGTGACTTT ATTGCTGTAA AAATCATACG TGTTTATCAT CTTAAAATTC 
42201 AGGAAACATG GACAGGTACA AAGATGTGCA AAATATCATC CAAAATCCCA 
42251 I I rGCTGGCC AGGCACGGTG GCTCACGCCT GTAATCCCAG CACATTGGGA 
42301 GGCCGAGGGG GGCAAATCAC TTGAGGTCAG GAGTTTGAGA CCAGCCTGGC - 
42351 CAACATCGTC AAACCCTATC TCTACTAAAA ATACAATAAT TAGGCTGGGC 
42401 GCAGTCGCTC ACGCCTATAA TCCCAGCACT TTGGGAGGGG JGAGGTGGGGG 
42451 AATCACAAGG TCAGGAGTTT GAGACTAGCC TGGCCAATAT GGTGAAACCC ■ 
42501 CATCTCTACT AAAAATACAA AAATTAGGGC CGGGTGTX5GT GGCTCACGCC 
42551 TGTAATCCCA GCACTTAGGG AGGCCGAGAC AGATCGATCG CGAGATCAGG 
42601 AGTTCGAGAC CAACCTAGCC AACATGGTGA AACCCCATCT CTACTAAAAA 
42651 AATACAAAAA TTATTCGGTT G7GGTGGCAC ACGCCTGTAA TCCCAGCTAC 
42701 TTGGGAGGCT GAGGCAGGAG AATCTCTTGA ACCTGGGAGG CAGAGGTTGC 
42751 AGTCAGTGGA GATCCCGCCG TTGCACTCCA GCCTGGGCGA CAGAGTGAGA 
42801 CTCCATCAAA AAAAAAAAAA AAAAAAAAAA AAATTAGCCG GGCGTGGTGG 
42851 CGTGCACCTA TACTCCCAGC TACTTGGGAG GCTGAGGCAG GAGAATGGCT 
42901 TGAACCTGGA AGGGGGAGGT GGCAGTGAGC CGAGATCGTG CCATTCCACT 
42951 TCAGCCTCGG CGACAGAGCG AGACTCTGTC TCAAAAATAA TAATAATAAC 
43001 AATMCTAGC CGGGCCTGGT GGCACATGCC TGTAGTCCGA GTTACTCAGG 
43051 AGGCGGAGGC ATGAGACTCA GGTCAACTAG GGAGACAGAG GTTGCAGTGA 
43101 GCCAAGATCA CACCACTGCA CTCCAGCCTG GTTGACAGAG CGAGACTCTC 
43151 TCTCAAAAM AAAAAAATCC CATTTGCTCA 1 1 1 1 I PGGAT ACTAGTATAA 
43201 CTATCACTCT AAACCAGTTA GTACTTAAAT CAAGCAGATA TGGGAGATGG 
43251 TGAATTACCA TCTACAGTGT TGTCATATAT GTCACATACT GAGCATTATC 
43301 AGCTAGTAGA ATCTAGTTAA TTCTTCTATG TGTGATGTAT GCAGAGTTCC 
43351 CATTTTGAAT GTG1 1 I I IAC TATGCTTAAA TAAATGACTC ATGTCAGCAA 
43401 CCCCAAAATG ATACATCTGA TGTAAGAGCC CCTGTTCCCC AATAATAACA 
43451 TCTAAACTAT AGACATTGGA ATGAACAGGT GCCCCTAAGT TTCCTCCCTC 
43501 CAGGGTTTCT TQGCCGGTCT CTGAGGACTA CACATCCCTA CTCCCGTCTT 
43551 TCCTCATCTT CAGGCGCAGT AACAGTATCT CCAAGTCCCC TGGCCCCAGC 
43601 TCCCCAAAGG AGCCCCTGCT GTTCAGCGGT GACATCAGCC GCTCAGAATC 
43651 CCTTCGTTGT TCCAGCAGCT ATTCACAGCA GATCTTCCGG CCCTGTGACC 
43701 TAATCCATGG GGAGGTCCTC GGGAAGGGCT TCTTTCGGCA GGCTATCAAG 
43751 GTGAGCGCAG GCAACAATTG CTTTGCTCTT CTCCCCCCAG TCCCTCTGTC 
43801 ACTGTCTTTC GGGGATTTCT CATCACT7GG CCCCACCCCA CACCATGCAG 
43851 GATGCCAGGC CTCCTTCCTG GCTTTGGGTC TTGGTGTGAG AGGTATCCTT 
43901 CACCCCCACC CAGGCCACCT AAGGTCAATG TTGCTGTTAC AGTGAGCTTG 
43951 TGGACCTCGA GATCCAGGTT GGGTTGAGCT GTGCCTGTGG CCCTCCTCCC 
44001 TCCAGTCAGT GGGTGTTTGT TAGGTGCCTG CAGACCTCAG TACCGGGCAT 
44051 GCTACAAGGA GCACACAGGG GAATGGCTCC TGCCTCCCTG GTCAACAGTC 
44101 TCAGGGACTA ACCTCTCTCT TTCTCTCCTC CTCCTCCTCT TCTGCTGAGA 
44151 ACTGGGAGGG GGGGTCAGGT AAGACGTGTG TCTCAGCTTG GGGGCAGCAG 
44201 GGCTGGAGAG CTCACCCCCG ATCCACCCAG CTCCCTGGTG CATGTCTTTG 
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44251 GCACPGACCT TCCTCCCCCC AGACTTCTGT TCACTCAQGA GACTCACTTC 
44301 TATGCCAAAT GACCAGAGCC CCIGU IGGC TTQGCAGCAT CCCCTCCTGC 
44351 LI ICITCCCC ACTTCCCTTT TCPGGGTTCT TCCCTCTCCT CTGTGCATGC 
44401 CCAQCTCTCC AGGAAAGAGG GTTTGCTTCC GTGTGAGTCC CATCTTGCTC 
44451 CACGCTGCAT CTTCCACACA TGAACTCTGT CATTCTGACC GGGCTCAGTG 
44501 TCCCCTCCAA GGGATGGGAT GGCCAGCTGC ATAGATTTTC TCAAACAGTT 
44551 CTCCAGAACT TCCTCTGGTC TCAGCACCAT TAACAGTCAC CCTCCCTGTA 
44601 GGTGACACAC AAAGCCACGG GCAAAGTGAT GGTCA7GAAA GAGTTAATTC 
44651 GATCTGATGA GGAGACCCAG AAAACTTTTC TCACTCAGGT AAGAAGATGG 
44701 AGGGGGCCCG GGAGGTTCGT GTCACCATTC GAAGAGAGAA GACCTTACAA 
44751 ATAATGGCTT CAAGAGAAAA TACAGTTTCG AATTACTGTC TTAAAGACTA 
44801 AGCAGAAAAG AGCCCTAGAG GAATATCCCA CTCCCTCTAA ATTACAGCXTT 
44851 AATTATTTGT TCAATGAACA CTTACTAAAA GCAACACAAA CAGGGTACAA 
44901 GGGATGCAGT AACAAAAGAT ACAGGGTTCA GAAGAGCTCT CAGGTTATCA 
44951 GGATGATGGA CATCAAAACA CTCCAATTTA GTACAACTCA ATGTTATMT 
45001 CCTCACCTGA AGGCCCTGCT AAGGGAGCCT GGAGGGGAGC TCCCTCAGCA 
45051 CTCACACTCC TTGGGCATTT ACAGTTTTCA CTACCCCTCC CAAGTTACTT 
45101 CATCGAGTAA CTTAAGTTGG GGAGACCTCT GGTCTGGGTA TTCCCCTCCA 
45151 AGCCACTTCG CCACTCCCAC CCCAGTTCTC CCAATGCAGT TCCAAGGGTA 
45201 AGGCCTATGA AGCCATCTCC ATCTATATCG TGGTCGTCTT CCCTCATGCT 
45251 GATCTTAGTC CCCTCTCATA TCACAAGATA GGAGGTAGGA GATAGAGGTG 
45301 GTAACACTTC TCAAGCTGAT TCCTTGGAGG GAAGAGGTM GGAAGACAGT 
45351 GAGAAGTTM CCACCAGCTT TCCTTGGCTT CGCCCACCCC CAGGTGAAAG 
45401 TGATCCGCAG CGTCGACCAC CCCAATCTGC TGAAGTTCAT TGGTGTGCTG 
45451 TACAAGGATA AGAAGCTGAA CCTGCTGAGA GAGTACATTG AGGGGGGCAC 
45501 ACTCAAGGAC TTTCTGGGCA GTATGGTGAG.CACACCACCC CATAGTGTCC 
45551 AGGAGCCTTG GTGGGTTGTC AGACACCTAT GCTATCACTA CCCTAGGAGC 
45601 TTAAAGGGCA GAGGGGCCCT GCTTTGCCTC CAAAGGACCA TGCTGGGTGG 
45651 GACTGAGCAT ACATAGGGAG GCTTCACTGG GAGACCACAT TGACCCATGG 
45701 GGCCTGGACC ACGAGTGGGA CAGGGCTCAA CAGCCTCTGA AAATCATTCC 
45751 CCATTCTGCA GGATCCGTTC CGGTGGCAGC AGAAGGTCAG: GnTGCCAAA 
45801 GGAATCGCCT CCGGAATQGT GAGTCCCACC AACAAACCTG* CCAGCAGGGC 
45851 GAGAGTAGGG AGAGGTGTGA GAATTGTGGG CTTCACTGGA AGGTAGAGAC 
45901 CCCTTCCTAT GCAALI PGTC TGGGCTQGGT CAGCAGCTAT TCATTGAGTT 
45951 TGTCTGTGTC ACTGAAACTG ACCCCAGCCA ACTCTTCTCA GTTCACAGCC 
46001 GIGI I I ICAA AGAATTACAC ATCTCTAAAG GCAAACAGGG CACGGACAAG 
46051 GCAAACTCGA GAGGCAAACT GTAGCCTCAG ATGGCCTCGG CTTGCCATCA 
46101 CAGGTATTGA GGTCCTGAGG GCCCTTAGAC CAACTAGAGC ACCTCACTGC 
46151 CTAGGAAATC AATGAAGGGG AAATGAGTTC TAGCGGAGCC CTGAAGGATC 
46201 AGAATTGGAT AAAGTTCTTA TTGGCAGAGA GGCACCAGGA TTGAAGTCAC 
46251 AGGAGCAAAG ACCTQGGAGG AAAGAGGAGA AAATCATCTA TTTCACCTGG 
46301 AAAGAAATCA TTCCAAGCAT AGAAATAATA AGAGCTGACA AGTACTCAGT 
46351 GCCCTCTATA TGCTAGGCAC TGGGCTGAGG GATTAACATG CATGTGCATG 
46401 TTTATTCCTC ATGACAACCT TGGTTTCCAG ATAAGCTCGA CTGGAAAGGG 
46451 ACAGAGCTGG GATCCTGGGC TAATCAGTCT GGTCGCCAAG CCTGAGACTT 
46501 TAGCCACTGC CCTTCACATG GGGGTCCATG AAAATAGTAG TAGTCTGGAA 
46551 CAGTTTGGGG GTAGATCAAG GTCGCTGTGT TTTAAGCTAT GGAGTCTCGA 
46601 CTATAGGAGA GAAATGTAAA AGAGI I I I I I GGTTCACTGG CI I I I IGGTT 
46651 I I HIGH lb I I IGI I IGI I TGTTTGTTTC I I IGI I IGI I TTTTCCTGTT 
46701 TCTGGGGCTT GAATCAGGAA GGAGGI I 1 1 I TTC1 IGI IGI IGI I I IGAGA 
46751 AAGGATATTC CTCTGTTGCC CAGACTCGAG TGCAGTGGCA CGATCATQGC 
46801 TCACTAGAGC TTCGACCTCC TGGGCTCAAG CAATCCTCCT GCCTTAGCCT 
46851 CCGAAGTAGC TGGACTACAG GTGTGTACCA CCACACCTAA I I I I I IGAAT 
46901 1 1 1 1 1 1 1 1 CTT 1 1 1 1 1 1 I 1 1 I 1 1 1 1 1 I 1 1 1 1 GGTAGAGACA GGTTCTCACT 
46951 TTGTTGCCCA GGCCTGAATC TCAAACTCCT GGGCTCAAGC ATTCCTCCTG 
47001 CCTCGCCCTC CCAAAGTGTT GGGATTACAG TTGTGAGCCA CCATGCCCGG 
47051 GAGGAAAAGA I I I I IAAGCA AGAAAGCTTA AGAGCTGTGG I I I I ICGAAA 
47101 ATGAGTCTCG GCTQGCACAG TCGCTCATGC CTGTAATCCC AGGACI 1 1 1 I 
47151 TQGGAGGCGG AGGTGAGTCG ATCACTTGAG GTCAGGAGTT TGAGACCAGC 



FIGURE 3-16 



47201 CTGGCCAACT GGTGAAACCC CTGTTTCTAC TAAAGAAAAA AATCCAAAAA 
47251 TTAGCTQQQC GTGGTGGTGC ACGCCTGTAG TCCCAGCTAC TCAGGAGGCC 
47301 GAGGCAGGAG AATAGCTTGA ACCTGGGAGG CAGAAGTTGC AGTGAGCCAA 
47351 GATCACACCA CTGCATTCCA GCCTGGGTGA CAGAGTGAGA OTCATCTCA 
47401 AAAAAAAAAA AAAAGAGAGA CTGATATGGT TAGTACATTG GGGTGGAATG 
47451 CGGAGGGTCC AGGGAATGGA GCCCTGCATA GGGGGCTAAT GAAACATTTC 
47501 AGATTTCTGA ATTAAGGTAG TGGCTGTGGG GACAGGAGCC TGGGAGGCAG 
47551 GGTGGAGTCA GAATGGAGAG ACTGG1 IGGC AATGAGGGAA CAGGAGGAGG 
47601 AGGAGGAGGA GTTACGAGTG GCTTGAGGTG TCACTTACCA GACATTTGGG 
47651 GGATGGGGGA TAGCCGTGAT TGTTGAGCAA CTCGTTTGGG AAGAGCTAGC 
47701 ATTGATCCCT GCTGTTCTGT GCTAGCAGAA CCTATCAGCA TCTTCTGGGC 
47751 AGGAAACTGG CTCCATGAGA CTGGCTTAGG GAGAGGCTGC TAGTCACCTA 
47801 ATCTGCAGAG AAGGGGCAGC TGGAGCTGTG GGACAGAAGA GGCATCCATG 
47851 TAGCTGGTCG GGGTGTCTCA GCTTGTGAAG AGGAGATGGC TTTCAGCAGG 
47901 GCTGACACTG AAAAGGCTGG AAGAAAAAAA CAGACACACA AGAGTCTCAG 
47951 GATCAGGTAG CATAGGAAAG TTGTGGACAG TCTTTGAGGA GCACTCCCTC 
48001 AGGCAGGCAG GCAGGCAGGT CATGAGCTAT AGGGATTCAG GAAGAGCTCC 
48051 CTGGGTGTGT GAGCAGCTCC AGGAGCCTAA GGGATGAAAG TAGTATTGCA 
48101 GGGGGCTGGA GAGCAAGGAG TGGCTCCTTC TACATTTGCA AGGGAAGGAG 
48151 AAAGGAAGTT GCTCCTGAGA GTGGTAAGAG- TCAGTGGTGG AGGCCTGGAG 
48201 AGGAGACATA ACAAACAAAT TTGTTGACAA . ACATTTTGGT AGGAAGGGGG 
48251 AGAGCTTAAA GTTTAGACAG TGGGGAAGGT GGAGTCTTAG AGGAGGTGAA 
.48301 7GTCTGAAAG ACAGAGCTAG CTGGAGCAAG' AAGTCAGTTC TLIGI IGCAG 
48351 GCAGGAAGGA TCCAAAGTGG -CTCAAGCCAG . AGATTGGGAG AGTGGGGAGG 
48401 AGGGAGCAGC CTGGATCTAA GTAAAATGGG TAGAGGTGGA GGGGGTGCTG 
48451 CAACGGCCAG GGTTTTCTGA AGTTGGGGAC • ATTAGGAGAG AGCTGTGAGG 
48501 GCTTTGGCCA GCCACTGTGC TAGTGATTGG TGAACCAAAG GATGGGCAGG 
48551 AGATGGCAGC AGGGAAGCAG AGGAAGTCCA GGCTTCCTGT TGGTATTGGG 
48601 ACAAGGGAGA GGCCATAGGA GGCCCTGGCC CTGTTGTCCA GGTTGGGTTC 
48651 TGAAGCTGGG TGGGCATGGC CTGGTAGGAG AGCATCTATG GGGCCCAATT 
48701 CCAGATTCAG GGTCTAGTTG ATTTGCTGGC CCTCTAGCCT CAGCTCATGC 
48751 TTCTGTTCCA GGCCTATTTG CACTCTATGT GCATCATCCA CCGGGATCTG 
48801 AACTCGCACA ACTGCCTCAT CAAGTTGGTA TGTCCCACTG CTCTGGGCCT 
48851 GGCCTCCAGG GTCCTATCCT TCCTGGCTTC CTTGTCACAA AGGAGGCTGA 
48901 CTTGTCCCCT CTGGCTAGAG GGCAGAGGTG TTGCCTAGGA GCTCGTATCT 
48951 TTCCCTTCCT GCTTCTTCCA ATGCCCTTCT CTGTCCTCTG GGAGCTCCGA 
49001 GACACACACA GACATAATTT CACCTTCTCT CATTAGCAAC CTTTCAAATA 
49051 ATTTGATTAG AAGGGACTTC AGAAGI I IGI TGACTATATG TAGAAAACCC 
49101 TGTCATTTTA CCTGCTTTTG CCCCATAGTA GTCTTGTAAA ACAGTTCATT 
49151 GCTGACCCC4 TTTTACAGTG GTGGCACCTG AAGCCTCAGC CTGAGGCCAC 
49201 CGAGCTAGTA AATTTACAGG GACCAGTTTG AGACCAGCAT TCCTCCCACT 
49251 GCCCCTCAGC TGTGGTGGTT ACAAIGI IGI I IGICI I ACT GACTTGCTAT 
49301 CTGGCTTCCT GGGTGTCTAC CGGCTGGCCC TGGCTCTGCC CTCTAGACCC 
49351 ACACCACGCA ATCTTCATTC CTTTCCCACA TGACTGCCCT GTAGCTATTC 
49401 AAAGAGCTTG TCTCCCCCAA GTCTCCCCAT CTACTGCCTC CACCTTGCCT 
49451 TTTTCTGTCT TATCCTGGTT CTAGCCACTG CCTGAAATCA TTTTAGGAAT 
49501 AAGACAGGAC AGGGAAAAAC AAAAGCAACC CCCTGTCCCA CCTCTGAGTT 
49551 CCACTCTCCA AGTCCCTGAG CCTCACCTCC AGGGCTCCAG TGGCTCTGCC 
49601 ATGAACCCAC TGTGGGCTGG GAGTCTGCTG TGCACAGATA CCA GACCCT C 
49651 AGAAACACAA ATGCCAAGTG TGTCTGTTTT 1 1 IGI I I IGT I I IGI I I IGI 
49701 I I I I IAGATG GAGTCTCATT CTGTTTCCCA GGCTGGAGTG CAGTGGTGCA 
49751 ATCTTGGCTT ACTGCAGCCT CTACCTCCCG GGTTCTAGTG ATTGTTCTGC 
49801 TTCAGCCTCC CAGTAGCTAG GACTACAGGC GTGTGCCACC A CGCCCA GCT 
49851 AAI I I I I 1 1 1 I I I 1 1 1 I I I I TGTAI I I I IA GTAGAGACAG GGTTTTGCCA 
49901 TGTTGGCCAG GCTGGTCTTG AACTCCTGAC CTCAGGTGAT TCACCCGCCT 
49951 TGGCCTCCCA AAGTTCTGGG ATTACAGGTG GAAGCCACCG TGCCTGGCCT 
50001 GAGTGTGTCT ATTTGATAGA GCTTTCTGCT CTGATTCTCC CTTGCTATAC 
50051 ACCTTTTCTC CCCTTCTCAG TGGCTTCTCT TGCCTATGCT TCCTC CCCAG 
50101 GGCCAGGTTT GAGAACATCC CCATGAAGTC CTGACCTGTC TTTTATCCTA 
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50151 CCAGGACAAG ACTGTGGTGG TGGCAGACTT TGGGCTGTCA CGGCTCATAG 
50201 TGGAAGAGAG GAAAAGGGCC CCCATGGAGA AGGCCACCAC CAAGAAACGC 
50251 ACCTTGOGCA AGAACGACCG CAAGAAGGGC TACACGGTGG TGGGAAACCC 
50301 CTACTGGATC GCCCCTGAGA TGCTGAAGGG TGAGTCCTGA AGCCCTGGAG 
50351 GGGACACCCG CAGAGGGAGG ACAGATGCTG CCCTTGCATC AGAGCCCTGG 
50401 GAATTCCAGG GGAGGCCTGT GAAGCGTAGG ACCGGATACC CAGAGCTGAG 
50451 GATATTTTTC CCTTGCCAGG TGGGGCCTCA CGATTTAGCT CCTGAGCTCA 
50501 GGGGGCTGGG AACTGATCAG TGTCCCATCA TGGGGGATAA GGTGAGTTCT 
50551 GACTGTGGCA TTTGTGCCTC AGGGATCGCT AAGAGCTCAG GCTATTGTCC 
50601 CAGCTTTAGC CTTCTCTCTC CATGGTGAGA ACTGAAGTGT GGTGCCCTCT 
50651 GGTGGATAAT GCTCAAACCA ACCAGAGATG CTQGTTGGGA TTCTTGAAAT 
50701 CAGGGTTGTG AGGCCTCAGA AATGGTCTGA ATACAATCCA TTTTGGAGTC 
50751 TGAGGCCCAG AGAAGTTCAG TGAATTGCCT AGGAGCATAC AGCTGCCTAA 
50801 TGGCAGAGGC TAGATGAACC CTAGTCTGGT TCTTTTCCAC TTTAACGTGC 
50851 AGTTTCATCC TAGGCAGTCT TATCTTATAA GGGCTCTCCA AGGCAGTTCA 
50901 CCTACGGCTG AGGAAGGACT ATTTTCAGGT GGTGTCTGCG CAGGACAGCC 
50951 TGTGGGGTGT CCCTACAGAA CCTGTTCTAG CCCTAGTTCT TAGCTGTGGC 
51001 TTAGATTGAC CCTAGACCCA GTGCAGAGCA GGTAAGGGAT GTAAACTTAA 
51051 CAGTGTGCTC TCCTGTGTTC CCCAAGGAAA GAGCTATGAT GAGACGGTGG 
51101 ATATCTTCTC CTTTGGGATC GTTCTCTGTC AGGTGAGCTC TGGCACCAAG 
51151 GCCATGCCCG AGGCAGCAGG , CCTAGCAGCTOGCCTTCCC TCGGAACTQG 
51201 GGCATCTCCT CCTAGGGATG ACTAGCTTGA CTAAAATCAA CATGGGTGTA 
51251 GGGTTTTATG GTTTATAACG GATCTCCAC4 TCTTTGCCAC GTTCGTGTTT 
51301 CATTGGTCTT AAGAGAAGGA CTGGCAGGGT • I I I 1 1 IGM I TAGATGGAGC 
51351 CTCACTTCGT TGCCCAGGCT GGAGTGCAGT GGGAGAATGT GGGCTCACTG 
51401 CAACCTCTGC CTTCTGGGTT CAAGTGATTG*TCCTGCCTCA GCCTCCCAAG 
51451 TAGCTGGGAC TACCGGCACA CACCACCATG CCGGGCTAAT I I I IGIATTT 
51501 TTAGTAGAGA CAGGGTTTCA CCATGTTGGC CAGGCTGGTC TTGAACTCCG 
51551 GACCTCAGGT GATCCGCCTC CCTCAGCCTC TAAAAGTGCT GGAATTAATA 
51601 GGCGTGAGCT ACCTCGCCCG GCCAGGTTTT 1 1 1 1 1 1 1 1 1 1 I I 1 1 IAGTTG 
51651 AGGAAACTGA GGCTTGGAAG AGGGCAGTGG CTTGCACATG GTCGATAAGG 
51701 GGCAGATGAG ACTCAGAATT CCAGAAGGAA GGGCAAGAGA CTGTTCATGT 
51751 GGCTGTCTAG CTAGCTCTTG GGCCAAATGT AGCCCTTCTC AGTTCCCTTC 
51801 AAGTAGAAGT AGCCACTCTA GGAAGTGTCA GCCCTGTGCC AGGTACCACG 
51851 TGGACAGAGT GAGGAATCTT GGAAAGATTC CTACCTTTAG GAGTTTAGTC 
51901 AGGTGACAGC ATATCTCAGC GACTCAAACA CACACACATT CAAAGCCTTC 
51951 TGTAATTCCT ACAAAGTTGT GAGGGGTAGA GGAGAGGAGA GACAAGGGAT 
52001 GGTTAGGATA ATGAAGGAAT GTTTTGTTTT TGI I 1 1 IGI I TTTGAGATGG 
52051 AGTTTCACTC TGTCACCCAG GCTGGAGTGC AGAGGTGCAA TCTTGGCTCA 
52101 CTGCAGCCTC CGCCTCCCAG GTTCAAGCAA TCCTCCTGCC TCAGCCTCCC 
52151 AAGTAGCTGG GACTACAGGT GTGCGCCACC ACGCCTGGCT AAI 1 1 I IOTA 
52201 TTTTCAGTAG AGACAGGGTT TCGCCATATT GGCCAGGCTG GTCTCAAATG 
52251 CCTGACCTCA GGTGATACAC CCGCTTCAGC CTCCCAAAGT GCTGAGATTA 
52301 CAGGCATGAG CTACCGTGCC TGGCCATGAA GGAAGATTTG TTTTAAAAAA 
52351 TTGTTTTCTT TAATATTAAT TGAACACCTC TGTTCAGAGC ACTGGGCTGG 
52401 TGCCAGAGGG TTTCAGACAT GAATCAGATC CAGCACCTCA TAGAGCCTTA 
52451 ATCTGGCACA CACACACAGC CACAAGGAGA CACAGACAAG GCAGGGTAGG 
52501 ATGAGTGGAA GCTAGGAGCA GATGCTGATT TGGAACACTT GGCTTCTGCA 
52551 GTGAAGCCCC TTCTTAGTCC TCTTCAGTAA CCCAGCTCTC AGTGGATACA 
52601 GGTCTGGATT AGTAAGATTT GGAGAGATGA TTGGGGATTG GGGAGAGCTC 
52651 TCTAACCTAT TTTACCACCT CCTCTTCTGC CATTCTTCCT GTCCACATCC 
52701 CCAGCATCCC TTTCCCTTGC CAAGTATCTG TGGCCTCTGT AGTCCTTTGT 
52751 AAACAGCTGT CTTCTTACCC TACAGATCAT TGGGCAGGTG TATGCAGATC 
52801 CTGACTGCCT TCCCCGAACA CTGGACTTTG GCCTCAACGT GAAGCTTTTC 
52851 TGGGAGAAGT TTGTTCCCAC AGATTGTCCC CCGGCCTTCT TCCCGCTGGC 
52901 CGCCATCTGC TGCAGACTGG AGCCTGAGAG CAGGTTGGTA TCCTGCCTTT 
52951 TTCTCCCAGC TCACAGGGTC CTGGGACGTT TGCCTCTGTC TAAGGCCACC 
53001 CCTGAGCCCT CTGCAAGCAC AGGGGTGAGA GAAGCCTTGA GGTCAAGAAT 
53051 GTGGCTGTCA ACCCCTGAGC CATCTGACAA CACATATGTA CAGGTTGGAG 
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53101 AAGAGAGAGG TAAAGACATA GCAGCAAGTA ATCTGGATAG GACACAGAAA 
53151 CACAGCCATT AAAAGAAAGT TTAAAAGAAG GAAATTCACC CAAACCATTT 
53201 GAATACAGTA AGTGTATTCA TCTTTCGATA TTCCCCTGTC CATATCTACA 
53251 CATATACTTT TTTTTATAGT AAATAGTTCT GTATTTTCCC CTGCATTTCC 
53301 CTTGTCTTTA CTATCCAGTC TTCCTGTTTA TCATTTTTCT CGACAACATG 
53351 AAATTCTATT GAGAGACTGT CTGAACATAT TGTAATGTAG ATGTTCAGGT 
53401 TTTTCCAGTT TCTCTTTACA ATAGGTATTT AACTACAGTG AGCAGTTTTA 
53451 TGCATTTAGC TAATTTCTCC TTTCAGGAAG TATTTTCAAA ATTACCTTTA 
53501 TTCTTCTCAG GTAATAATTT CATTATTACC AAAGTTACCC TAGGTCTTTT 
53551 CAAGTGTGTC GTTAAAAAAC GAGAATCTGG CTGGGCGCGA TGGCTCACAC 
53601 CPGTAATCCC AGCACTTTQG GAGGCTGAGG CTGGTGGATC ACCTGAGGTC 
53651 TGGAGTTCGA GACCAGCCTG GCCAACATGG TGAAACCCCA TCTCTACTAA 
53701 AAATACAAAA CTTAGCCAGG CATGGTGGCA GGTGCCTGTA ACCCCAGCTA 
53751 CTTGGGAGGC TGAGGCAGGA GAATTGCTTG AACCCAGGGG CGGAGGTTGC 
53801 AGTGAGCCGA TATCACGCCA TTGCACTCCA GCCTGGGCAA CAAGAGTCAA 
53851 ACTCTGTCTC AAAAATGGGG TTCTTTTCCT GCCATCAAAA ATCATGTTTC 
53901 TTTTAAAAAC AAGTTCAAAC ATTACCAAAG TTTATAGCAC AGGAAATAGG 
53951 TCTTCTGTAA TCTCCCTTAA CCAATATATC CGTCAACATT CTCCTCACCC 
54001 CCAACTCCAC CCTCCCAGGA TAACCAGTTG GGACATAATC TTTATTTAAA 
54051 AATGGTTTCC GGATAGAGAA AGCGCTTGGG GGGCGGGAGC CCCGGCGGCG 
54101 GCCGCAGGGG ACAAAGGGCG GGCGGATGGG GGGGGAGGGG GCGGGGGGCG 
54151 ACCAGGCCAG GCCGGGGGGC TCCGCATGCT GCAGCTGCCT CTCGGGGGCC 
54201 CCCGCCGCGG CCCTCGCCGC GGAGCCGGGG AGCTAACCTG AGCGAGCCGG 
54251 CGGGCGTCAC GGAGGCGGCG GGACAAGGAG GGGCCCCACG GGGGCAGGTG 
54301 GCCCCGGAGG CCGCCGTGGC GGACAGGGGG ACGGGGGGGG GGGGGGGGTT 
54351 GGCGGCCCGG GCCCGGGCCC CCAGGGCAGG CAGTGGGGGC GAAGGACCAC 
54401 GCATCTACTT TCAGAGCCCC CCCCGGGGCC GCAGGAGAGG GCCCGGGCTG 
54451 GGCGGATGAT GAGGGCCCAG TGAGGCGCCA AGGGAAGGTC ACCATCAAGT 
54501 ATGACCCCAA GGAGCTAGGG AAGCACCTCA ACCTAGAGGA GTGGATCCTG 
54551 GAGCAGCTCA GGCGCCTCTA CGACTGCCAG GAAGAGGAGA TCTCAGAACT 
54601 AGAGATTCAC GTGGATGAGC TCCTGGACAT GGAGAGTGAC GATGCCTGGG 
54651 CTTCCAGGGT CAAGGAGCTG CTGGT7GACT GTTAGAAACC CACAGAGGCC 
54701 TTCATCTCTC GCCTGCTGGA CAAGATCGGG GCCATGCAGA AGC7GAGCAC 
54751 ACCCCAGAAG AAGTGAGGGT CCCCGACCCA GGGGAAGGGT GGCTCCCATA 
54801 GGACAATCGC TACCCCCGGA CCTGGTAGCA ACAGCAATAC CGGGGGACCC 
54851 TGCGGCCAGG CCTGGTTCCA TGAGCAGGGC TCCTCGTCCC CCTCGCCCAG 
54901 GGGTCTCTTC CCCTGCCCCC TCAGTTTTCC AC 1 1 1 R3GAT I I I I I IATTG 
54951 TTATTAAACT GATGGGACTT IGIGI I I I IA TATTCACTCT GCGGCACGGG 
55001 CCCTTTAATA AAGGGAGGTA GGGTACGCCT TTGGTGCAGC TCAAAAAAAA 
55051 AAAAAAAAAT GATTTCCAGC GGTCCACATT AGAGTTGAAA TTTTCTGGTC 
55101 GGAGAATCTA TACCTTGTTC CTTTATAGGC CAAGGACCGC AGTCCTTCAG 
55151 TAACACCAGT GTAAAAGCTT GAGGAGAAAT TGTCAAGCTA CACAGTATTT 
55201 GTTTTCTAAT ACCTCTTGTC ATTCTAAATA TCTTTAATTT ATTAAAAAAT 
55251 ATATATATAC AGTATTGAAT GCCTACTGTG TCCTAGGTAC AGTTCTAAAC 
55301 ACTTGGGTTA CAGCAGCGAA CAAAATAAAG GTGCTTACCC TCATAGAACA 
55351 TAGATTCTAG CATGGTATCT ACTGTATCAT ACAGTAGATA CAATAAGTAA 
55401 ACTATATTGA ATATTAGAAT GTGGCAGATG CTATGGAAAA AGAGTCAAGA 
55451 CAAGTAAAGA CGATTGTTCA GGGTACCAGT TGCAATTTTA AATATGGTCG 
55501 TCAGAGCAGG CCTCACTGAG GTGACATGAC ATTTAAGCAT AAACATGGAG 
55551 GAGGAGGAGT AAGCCTGAGC TGTCTTAGGC TTCCGGGGCA GCCAAGCCAT 
55601 TTCCGTGGCA CTAGGAGCCT GGTGTTTCCG ATTCCACCTT TGATAACTGC 
55651 ATTTTCTCTA AGATATGGGA GGGAAG 1 1 1 1 TCTCCTATTG I I I I IAAGTA 
55701 TTAACTCCAG CTAGTCCAGC CTTGTTATAG TGTTACCTAA TCTTTATAGC 
55751 AAATATATGA GGTACCGGTA ACATTATGCC CATTTCTCAC AGAGGCACTA 
55801 CTAGGTGAAG GAGTTTGCCT GACGTTATAC AACCAGGAAG TAGCTGAGCC 
55851 TAGATCCCTT CCACCCACCC CATGGCCCTG CTCATGTTCC ACCTGCCTCT 
55901 AATTTACCTC TTTTCCTTCT AGACCAGCAT TCTCGAAATT GGAGGACTCC 
55951 TTTCAGGCCC TCTCCCTGTA CCTGGGGGAG CTGGGCATCC CGCTGCCTGC 
56001 AGAGCTCGAG GAGTTGGACC ACACTCTGAG CATCCAGTAC GGCCTCACCC 
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56051 gggactcacc tccctagccc tggcccagcc ccctgcaggg gggtgttcta 
56101 cagccagcat tgcccctctg tgccccattc cpgctgtgag cagggccgtc 
56151 gqggcttcct gtggattggc ggaatcttta gaagcagaac aagccattcc 
56201 tattacctcc ccaggaggca agtgggcgca gcaccagqga aatgtatctc 
56251 cacaggttct qqggcctagt tacigicitjt aaatccaata cttgcctgaa 
56301 agcpgtgaag aagaaaaaaa cccctggcct ttgggccagg aggaatctgt 
56351 tactcgaatc cacccaqgaa ctccctggca gtggattgtg ggaggctctt 
56401 gcftacacta atcagcgtga cctggacctg ctgggcagga tcccagggtg 
56451 aacctgcctg tgaactctga agtcactagt ccagctgggt gcaggaqgac 
56501 ttcaagtgtc tggacgaaag aaagactgat ggctcaaagg gtgtcaaaaa 
56551 gtcagtgatc ctcccccttt ctactccaga tcctgtcctt cctggagcaa 
56601 ggttgaggga gtaggttttc aagagtccct taatatctgg tggaacaggc 
56651 caggagttag agaaagggct ggli icigm tacctgctca ctcgctctag 
56701 ccagcccagg gaccacatca atgtgagagg aagcctccac ctcaigi i i i 
56751 caaacttaat actggagact ggctgagaac ttacggacaa catcctttct 
56801 gtctgaaaca aacagtcaca agcacaggaa gaggctgggg , gactagaaag 
56851 aggccctgcc ctctagaaag ctcagatctt ggc1 iligi i actcatactc 
56901 gggtcggctc otagtcaga tccctaaaac attttgccta aagctcgatc 
56951 ggttctggag gacagtgtgg cttgtcacag gcctagagtc tgagggaggg 
57001 gagtcggagt ctcagcaatc tcttggtot ggcttcatgg caaccagtgc 
57051 tcacccttca acatgcctgg tttaggcagc aggtggggt..gggaagaggt 
57101 ggtggcagag tctcaaagct gagatgctga gagagatagc tccctcagct 
57151 gggccatctc acttctacct cccatgtttg ctctcgcaac tcattagctc: 
57201 ctgggcagca tcctcctgag ccacatgtgg aggtagtgga aaaggtggat 
57251 cttggctccc agagctctag gaactcttca tcacaactag; atrtgcgtct 
57301 tctaagtgtc tatgagcttg caccatattt aataaattcg' .gaatgggttt 
57351 ggggtattaa tgcaatgtgt ggtggttgta ttggagcagg gggaattgat 
57401 aaaggagagt ggttcctgtt aatattatct tatctattgg gtcgtatgtc 
57451 aaatattgta catagacctc atgagttgtg ggaccagatg tcatctctgg 
57501 tcagagttta cttgctatat agactgtact tatctgtgaa gtttgcaagc 
57551 ttgctttagg gctgagccct ggactcccag cagcagcaga ,gttcagcatt 
57601 gtgtggctcg ttgtttcctc gctgtcccca gcaagtctag gagtggtggg 
57651 cctgaactgg gccattgatc agactaaata aattaagcag ttaacataac 
57701 tggcaatatc gagagtgaaa acatgattgg ctcagggaca taaatgtaga 
57751 gggtctgcta gccaccttct ggcctagccc acacaaactc cccatagcag 
57801 agagttttca tgcacccaag tctaaaaccc tcaagcagac acccatctgc 
57851 tctagagaat atgtacatcc cacctgaggc aggcccttcc ttggagcagg 
57901 tgtgactgac tatgaccttt tcctggcctc gctctcacat gccagctgag 
57951 tcattcctta ggagccctac cctttcatcc tctctatatg aatacttcca 
58001 tagcctgggt atcctggctt gctttcctca gtgctgggtg ccacctttgc 
58051 aatgggaaga aatgaatgca agtcacccca ccccttgtgt ttccttacaa 
58101 gtgcttgaga ggagaagacc agtttcttct tgcttctgca tgtgggggat 
58151 gtcgtagaag agtgaccatt gggaaggaca atgctatctg gttagtgggg 
58201 ccttgggcac aatataaatc tgtaaaccca aaggtgtttt ctcccaggca 
58251 ctctcaaagc ttgaagaatc caacttaagg acagaatatg gttcccgaaa 
58301 aaaactgatg atctggagta cgcattgctg gcagaaccac agagcaatgg 
58351 ctgggcatgg gcagaggtca tctgggtgtt cctgaggctg ataacctgtg 
58401 gctgaaatcc cttgctaaaa gtccaggaga cactcctgtt ggtatctttt 
58451 cttctggagt catagtagtc accttgcagg gaacttcctc agcccagggc 
58501 tgctgcaggc agcccagtga cccttcctcc tctgcagtta ttcccccttt 
58551 ggctgctgca gcaccacccc cgtcacccac cacccaaccc ctgccgcact 
58601 ccagccttta acaagggctg tctagatatt cattttaact acctccacct 
58651 tggaaacaat tgctgaaggg gagaggattt gcaatgacca accaccttgt 
58701 tgggacgcct gcacacctgt ctttcctgct tcaacctgaa agattcctga 
58751 tgatgataat ctggacacag aagccgggca cggtggctct agcctgtaat 
58801 ctcagcactt tgggaggcct cagcaggtgg atcacctgag atcaagagtt 
58851 tgagaacagc ctgaccaaca tggtgaaacc ccgtctctac taaaaataca 
58901 aaaattagcc aggtgtggtg gcacatacct gtaatcccag ctactctgga 
58951 ggctgaggca ggagaatcgc ttgaacccac aaggcagagg ttgcagtgag 
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59001 GCGAGATCAT GCCATTCCAC TCCAGCCTGT GCAACAAGAG CCAMCTCCA 
59051 TCTCAAAAAA AAAAA (SEQ ID NO: 3) 
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Context: 



DMA 

Position 

941 C^CTMCTGjGTGCTG^GGTTAC 

MQGnnT3GCTCTAAMTMTGAGATCraCTQQQG^ 
ACCCTCAMGCTCTrACATTrrMGAGTTCCAA 

ttggaigi i rcrci ! CATTAAAATCTCTCACTAATTCA I igiu igiccactgtccgtaa 

CCCMCCTGGGATTGGTTTCAGTGAGTCTCT 
[A,T] 

GATQGCATACTOXnXiACGVCTCrCACCCTAAMCCAAAAAGGGCCCTCTr^ 
TCTCAGGATTTTAGACCCAGGMGMTGAGTlGATQQQGAW 
GCATX^GMGAGreGAATCGGreGGTrGAQGTCGTCTT^ 
TTMCTCTTCTCTCGGGMCGAGGGGG^ 

TTGAGCAATCTTGAAGreCCAQGAGL I G I GC I 1 1 G I CTATTCATQGCCCCTCreCCTCre 

2612 TGAGTTQGMCAGTTTCATACCAAAACCATCCCCCCGCCCCCCMCCC^ 
. CCGrQGAAAMTTQGCCCCTXXJTGCCAAAMQCJrTCAGGACTGO 
TTTATTCAA IGI HGG1 I (aAOTAMT^GCTCTTQGATTAQGTXjATQGAAAMTCrGAAAA 
MCAQQQC7TTreAQGAATAGGAAAAGGCAGTMCIAT(nTTM 
GGCTGTTGXjCTGX3G7\ATAGTCATAGGAAGGXCT 

[G,A] 

1 1 IU ICI I CTCAGAGCTATAAGCAMGGCTGAMGrrCTAGAAAAAQGCAAG I HIGH 
TCAGTAGAAAAAAGGATAATCAGAACCAI I I I I AGAAMTGXW\TGAGACTACTrTTGAG 
GCCATGAGTTCCTTCrCCCTTjGAGAGATGAGCAGAQGrrQGACAAG IGCI I ACCAGAGAT 
CTTGTGGAGGCAGAMCTCreCATCTAGCAGAGC^^ 
GCTGTTMCTCAGTCrrATTCTACATCGTAGGAATCCT 

5080 ACAAOGTAAAATAGTTGAAA I I IGI I GG I GGAMGAAGAGCAGTCCACTCCAGAQGCTQG 

ATGGGCATCCCTCGXICCCCAAGGTCTCAAGTGGTAGGG^ 

A(^TAGACTAGGCAGGG^CCTTGT^ I IGI IG 

TAAMCATCCOUTGCXrATACCAAGTMTTG^ 

OXKXSMCCATATAGGGGATTGGCCTCGAGAO^ 

[G,A] 

CCATC^TTATTATCCTTTCCTTTC^GCTATMCTCAGAGCrCTCAAGTL I I I ICTGTGGA 
TCTTATTCCU IGGI ICI I GCCCCTTTTACrCCCAGGGMGTTGATTCTGTL I I I ILIGI 
TCG^TTTAGTATGAOVGGAGOVGAGAATGTO\GAGCTGTM 
OTTGGCTGGTCCmOXTTTTATAGXTGGGACrMTM 

6599 CTGTMTCCTAGCACTCTGGGAGGCOGAGGCAGAAGGATCGCTTG^ 
GAGTTTGAGACCAGCCTGGCCAACATGXiCAAMCTCC^CCTC^^ 
ATTAGCCAGGOGTGATCGCACACACCTGTAGTCCCAGCTACITGGGA^ 
TGATTACCTCAGCCCAGGGATATCAAGGCT&TAG^ 
CATCCAGCTGGGGGACAGAGTGAAACCCCTGT^ 
[-,A,C] 
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CCTTMTAATCAGTAACTGTCACnTATATTA lb I lb I GAGTGTGTCTCTATATACACCT 
ATATGTATACATTTCTCTTATTACACAT^^ 
TAAGGGCMCTTTGAACTACCCTGACACMTCAAGC^ 
TAGAGTATCTAGGTTCTCTCTCCTAGTTGCAG^ 

CAGC7GTGCTGAAGGAGCACATCTCCTCAC 1 1 C I GAGCTTTCCCCTGGTAAATTCAAACT 

6983 ttCATT^TTGGTWCTCATGTGG^ 

GACACMTCMGCGW\TATCATTCCCGT^GAGGMGTAGAGTATCTAGb I IUGICTCC 
TAGnTKAGCrTTACaTGAQGAO^GAGACTCTMTCCAGCKs I GC I GAAGGAGCACATC 
TCCTGAU I (_ I GAGCTTTCCCCT37TAMTTCAM(^ 

GAGCCTCGTMTTTGCCCTGGGGAGAGTGAL I G I C I 1 1 1 GGATCTAATTTGAL I 1 1 IGCC 
[C,G] 

CAGTTCGV\GGAAMTCTrCAGGGCTAGGAAGGATTGTA 1 1 RG I L IGACCCCAGAGATAAC 
CraGGTTmaGGAACATOiGGCAT^ 

AGCTTGCCAGTGTTTCTCTGATGAATTTAGAGTACCTGAGTAGTGC^ 
GAGGACTCTCCCTCTXJTXKTACrCAGAGAMTTCATTCTrCAAG 
GCTCTTACCC^GCTCGGCTACAGTTACMTAAAGGAAATGAC 1 1 1 ICI ICTCCCCTTCCC 

9885 GGCGTGCCACCACACCTPGCOM I I I I 1 1 I I ATTTTAAGTAGAAACAAGGTCTTATTAAT 

ACTATGTTGCCCAGGCTGGTCTTGMCTC^ 
GCnGGGATTACGGAAGTAAGCGACTGTGCOGGCCAGTGO^ 
MOVGGMGGCCCAGAAAGGTTTGGAGTMCrTGTCC^GGGTCACACAGATC 
ACrCAGGTCrCCCTGGCTCCCAAGAGAGTL I GL I I I CCACTAGGACTCCCAGGAGAAAAA 

v., [a,-] ' • ' ■ 

j r: AAAAAAAAAMCAGTAGACTTGGAGACAGAAMTCTGATTTGAGTCr^ AGTTCAGCTAGG 
CrMCTGTGTMCTGTGGGCMGTTCOTTAGGCCCTGTGAGCCTCAG I MLI IATCTGTA 
AMTGTCATAAAAGAMTCCATCTCATGGAGTAGTTC^^ 
ATTAGMTGGTTTMTGTGAAGGATTAGCAGCAGCACATCGCMCA 
TTMCTATCCAMTATATCMGCGTGATTTGCTATATATAAMGTCATCAAATTAQ 

12538 ACrTGGGAGGCTC^GGCAGGAGMTG^G 

CAGATCACGCCACTGCACTCCAGCCTGGTGACAGAGTAAGACTCCATCT 
AAAAAAAAAAAAATTCCTTMTTTGGCCTACAGTAGAGCCCTCCGTMTCTG 
CCACATCTCCACAACCTCCTGCTCCCTGCACTTCAGCCTCACCTCTCrTCTGGAC^ 
CTCCTTCTGACAAGGGC I I IGI I CATTCTGCTCCCTCTGCCTAGAATGCCCCCTTACTCT 
[G,T] 

TTCACTTAACTCL I GL. I I ATCGTTTAGATCTTTACCTGGATGGCTCAGAGAAATATAGAA 
GTAATTCCTCACCCTGAAAAATAGGTTAGGTCCL I G I I I IAIGI I I I CATAGACCTTTCC 
TTTGAGGU I 1 1 1 1 I AAAAAAGTAGI I 1 1 MTCTCACATTTATTCATGTGATCATCTCCT 
TMTGATATCTTAAGACCTCTAATAGMCMTTTGGTCATGGACTGTGGGG I I I I IGCCC 
CTCATTGTGTCAGCACTGAGCATATTGT7GGCATAGGAGGGATA I I IGI I GAATGAATTG 

17707 GTAGTGGGTGCTCAGAGTGTTTGCTGGGTGAATGATGTA I I IGI I GAACGACTCTTTGGA 
CACTTGAATAMGTCCATCCAGTATGCACCATTACCATCTCTTCGCrCTACAATATTC^ 
TTAGGCAAGAGCTTATL I I I I GAGGTGATAAGATAAGCTCAAACTTATGTAGACTAAGAC 
CTCAGTCTGTAMTGTCATCCCTMGTCTTAMCCATCAAMCCAGGGCCTCMGGAATG 
GCATGCLI IUGCAACTGTAGCAACCTCUGIGU I ATTTTGCCGTGI I I I ICATTTTTC 
[T,C] 

CCCAAAAGCTAGAGTCCCTTCTCCCATGGGCAGTGCTGGAAGTGTGCTMCAM 

CTCCATAUGCI I ACGATTACAAAAAAMCCCTCAGCATCTCATGCCAGACTTGAGTTAA 

GGIIGIIIIUII IGIGI GTCAGCTGTATTCTGGTCATGACTTCCTGATGATGCCCTATA 

GAGATTTTGCTGAGAT(^GAGGGTGCTCCACTGCCATCAGTAG^ 

GCACGGI I I L I GAAGTTGGCTAATGTCATCCCTCACG I I IGI I IGI I IGAAAI I IGI I I I 

18219 TGCCATCAGTAGGACTGACTCTTGCAGAAGCACCGI I I L I GAAGTTGGCT AATGTCATCC 
CTCAGGI I IGI I IGI I IGAAAI I I GI 1 1 I AGTTCCAGAGATAGCACTTTCATGGAATGAC 
GCTATCTTCTAGAATCAC I I I I I 1 1 I 1 1 I 1 1 1 1 GAGTTGGAGTCTCGCTGTGTCGCCAGG 
CTGGAGTGCAGTCGCACAATCTCAGCrCACTGCAATCTCCACOT 
TCCCCTGCCTCAGCCTCCCGAGGAGCTGTTACTACAGGCGCACACCCCCACTCCrGG 
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[-,A] 

TTTTAIGIGI 1 1 I AGTAGAGACGGGGTTTCACCG I G I IGGCCAQGATCGTCTOGATCTCC 
TGAL Midi GATCTGCL HOG I OVGCCrCCOVAAGKKrreQGATTACAQGTCTGAGrGVC 
CGCGCCPGGCCTAGAATCACL I I I I I ATACCATAACGTGAGCACCACTGCCGQJTCACG^ 
AGGAAAGAGAG^QGCAGCTACTUTTGGGGrrTACAAATCGGTM 
GAMCTCTCTACITAGCCAAGGCrrMOW^ATGrCAATCACO 

19670 GACCCCCATOtfT^GCMCTATAGttC^^ 

ATGTCAGTTTACAGAQQGL I I I I G I ACTCATCATCTAGTTTAGTTCCTGCAACAACCTC 
TTCAGGAATATAGCACMGCAGX^CMG^ 

GrTTATGCreCraOGAAGGGCAGCACreAMTTAAMGAAAA^ UNCI GAGCTCAAATC 

CCATCCCCTITCCTCAA7UTCAGCTCT 

[C,T] 

AGAGCCTCAMTTGCTGGGTATGrnSACj I I L I I GTATCTGA 1 1 I I I CTAGATTTCCTGCC 

CAO^TTUITACTGTCreGATATOVQGAAAGAGTTrATO 

TAAGGTCrCATGATGAGTMCCCAGreAAMCATCAAGT^ 

ATTTCACTACTGCTGACTCCTX^TC^^ 

TTCCATCATCKKZCTAGAATTTATCTCAAGGMTCAMGC^ 

21153 GGACCCTTCTTTTAGAAGG^TG^^ 

AQGAGTQQQGCA<XiAMGATCGTTAGrAGATXKK5QGroCTM 
TCG^GXXXrCGX3AGTCCrCAAAMTTCrC^ 

GGGCTTCACACAAACA(j I I ICI IGGGI I I IGAAI IGl I I GACCAGAGL I I ILI ICCGACA 
AMGGrTGGGGTR^TTCATTCACrrACCA.CACCrrGCCK^ 

;-T..,7- ' [G,T] 

GTTATCA^GGCTATTCTrTCTCG^GC^ 

GGTTCTAAGGAGTCACj 1 1 IGl I CAGCTCCGTX^^GGFTTCCA^CTTATXWVATGTGCre 
GAGATTMCACCTCrCCTGCCATTTTATCCCrACTATMTT^ 
CAGTTGCCTClTiGCAGCCATAACTXaATCAA I G 1 1 C I GCCAGCTCXTCTGAGGACCTAGAA 
G7\GCAGTTTTOATCCAGGACCAGTTC^ 

24566 CT ACTCreGAGGCTG^GGTGAGAGGATCACITGAGTCCAGAAGGTCGAGCT 
AGTGAGCCATT^TCGCATCACCGCAC^^ 
AAAAAAAAAAMG*AAAAAAAAAMCACCCXG\C<^^ 
TAGTX^G^TMCCCCTC^GMCCTATTTCCTMTCTGTTAMTG^ 
CTCCTTTTACTCGCAATTTAMCATGA^ 

[c,-] 

TAGA^TATTMCTCCTCAATAAATCGTAGCl ICI I AACAGTATTCAAACCCATGTGCT 
CTTATCACATGCA I IGl IGICCCIGIGICCAGI I GG I GGAATXS3GAAAAGGCTCCCT7GT 
MCCCCATCrACCATCrrrATCAGAOTTTCCTXKICATQGTTCACAOT 
TGCACGGTGACTTCTCGCTCTTTACAATGGTXSAGC^ 

GATGTCACKXiCCCAAATCCAGTAGTCAGATCTCAG I G I IUGGI I I CCTCCAGCAGCCT 

26604 GATTTCCAGCT^GCCTGTCTATCTGGTCTGGGA^GAA 

CCGGCTTACTTCACCTCCAGAGACL I G I I I CGGTGAGTTGGTCTCCGAGTTCCCCTCTCC 
ATCTCTCCTCGCCCCKCT^ 

GTCCTTT ACO\TCGGTTCTGCCGGGCAGMGCCAGCGGAGGTTATACCCAAGG^GAATCG 

GCCTTGT^GGTACCCCCATTAT^ 

[G,A] 

AALI ILI I AGGGAGCTCCAGCTCCCCTTCTATCCCAG^CAMCCTGAAGGAGCCT 
AGATGCCACTGACGTKICCATTXrrAGATtjTTAC. I G(_ 1 1 CCGGGGGGAATAGCCCAAATAG 
AGIGLIGI I ICCAGCTCTCACAIGICI I ACCTGrGGGCCATCCTGCCTGCCCAGGMTTT 
GTCCCMCMGCAGGATGGGCAGG I I I I GCCAMGTJTCGAMCTXKaCMGTCCrGGGTC 
TXSGGTAGCCTGGTACACACTAGGCACCTTATAAACG 1 1 IGl I CTCTTAATGGCAGGCACA 

27255 TGGGGAMGACCTO^GAGTGXnTCTMGAaGXy\ 

AGCKOGXiGCCAGCCCCCA(^CCTCCTCAGTCCCTAGGCCT^ 
CTCTCTGTGGGGCTTCTCAGAGGGAGATGTGGAAACT I ILI I I 

GXTCXTTGXICCCACTCCACCTCC^ 
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CCU ICIGV\ATGGAGCCATTCCAGGCTAGGGTGGGGI 176171 I CATTCTT7GGGAGCAG 
[C,G] 

CTGTTGTTCOWWXGGCTGCCTC^^^ 

GGCTTCT CTAAGCTAGGTCGVGTGCCCAG^TCI TGC I GCCGGGATACTAGTOVoGTCGCC 
AGGCCCTCGGCAGAAAAGCAGTGTACCA I G I GG I I I lUGGAATGACCQGACCCTOGTAG 
ATTGCTCGGAA&TOTCTGGACAGGGGGMGGGGGAAGGGA^ 
CTACCAAGCGCCCTGCTAGACACTrTATCC^^ 

27399 AGA7CnX3GAAACTCTACCrCTAACCTGG LI I ICI I 1 bCTCATTGCCCCACTCCACCTCCC 
ATAGAAACTCCCCAGGGG b 1 1 I C I GG CCCTaGGGTCCCI ICIGMTGGAGCCATTCCAG 
GCTAGGGTGGGG I I IGM 1 1 CATTCTTTGGGAGCAGCCTG I IGI I CCAAAAAQGCTGCCT 
CCCCCTCACCA&TGCTCCTCGrCGACTTTTCCL 1 I C I GGC1TCTCTAAGCTAGGTCCAGT 
(XICC^GATCTTGCTGCCGGGATACTA 
[T,C] 

ACCATGTGGI I I I G I GGAATGACCGGACCCTGGTAGATTGCTGQGAAGTUrCn5GAC^QG 
GGGAAGGGGGAAGGGAACKXTTCCTCAATGCreACTCTACG 

TTATCCTTTAATCTCTCAACAGCCT AAAGAGATTATATATCCCCATTTTACAGATGAQQC 
MCCAGTTTCMCAGAGTTMCATATQGAGCCTCACTQQGCAGL 1 I 1 1 ICIGICI ICCTG 
ACTTTCTCTCATCCTTCAGQQGGCTGCAGG I ITGTTTTUI I CTCCTAGTGGAGAGGAAAT 

28088 AAGAGCOXATGGAAATTGATCTTGAGTTTAGGAGAA 
GCCAAGTGTTGAAGTAGCCACATITCAGGTCCT^ 

CAGCTTAGGAGAAGGGI IGI I CCTITAQGAGCCAGGMCTATACCCCTTTTACCCTTGGA 
7* GAGGCAGGGAAGCCAGGGAGGACACMCTTCT^ 

TGMCTCTCMCCreMCCTTTMGQGCCAGAGCACTMTGCXIACCCAAOT 
[G,A] 

TTTGTC I IGI I CTGTCCCAGGCTTTCTCGAGAACGTCATC I ICI I GCCCCTACCCCCAAG 
CTCGGTTTGCCCAGCTAGAGTC^ I ICI ICCCT 

TCCCCAMTAAGAGGCCACATTCCTGAAGTCAC 1 1 C I GAAGAGATAGCTGCCACACAGGG 
CTCTTTCCCCCCAGGGAGGGACCACCCAGACCCTCTCCTCTCCCAGOT 
ATCACTACCTGGTCAGAAAGC I G 1 1 1 L I bCCATTAGCCCCTCCCTCTTTTATTATAGGAT 

28734 AAGTAGAAGCT AGAC I ICI I GQGCTCCTGMCAQQGTCCTTGGTQGATTCTXnGAAACAA 
ATTAACil ICI I CjACCCTAGGCCTCTCQGGGAGTACAAAGTCTATGGGA(j I ICIGGGGCTG 
TGGTTGCAAGGAAAGTGACGCAACCAGATTCCATGGGG^ 

AGGGAGGAAGAGGGAGCAAGGGAATGAAGAATACAAC I ICICIGTCCCATACACCCCTGC 
CTGACAGGCCATACATACTCAGOVGAGAATGCAC I G I C I I I CCTACCACACTAGCGTGAG 
[G,A] 

AGTGAGC7GCAATTACCAC HGTGC1 ICO\AGTAAGAAMTACCTCAAATPGGAATTTACA 
AAAGAGGTAAATTAGGGAGTGGC 1 1 I I G I CGGACATCTTTAAAGCA I I I I ICI I I I I ATA 
GAATTTCACTTMTGTCC^TACTGATTTMTGAGC^ 

AGAAAACAAATGAACCI I7GTG1 I CO\MGCMTCCATGTTTAAAGGGAAAAAATTATGC 
ATMCTCTGCCCAOCTTG^CAGTM 

29246 MTCCATGTTTAAAGGGAAAAMTTATGCATMCTCT^ 

GCAGGTCCCTTAGGTCCTCTGGGACTCTTTTCCTTATCTGAAAM 
AGGTTW\TGGTTCCCAGCTCTGCMOT I I I 

CCATTATTTGCCAAATAATGGAGGCCC I G I C 1 1 I MCTGCAGTACAACTACACAAAATAC 
TTGAAACTACAGTCTTCC I GG I 1 1 1 IGGI I GGMCTGMTCAGTGCACTCTAGCAACACT 

[-,T] 

Al I ICI IGCIGI ICGTAGGCTTCATTAIGIGI I IGGI IAAI I I I I I AAAACAACAATAAC 
ATATTCCATMTMTTACAGCTTMTTGGCAGACTGTTTC^ 

AGGAGGAGTAATAAAGGGAI 1 1 1 1 GACR^GCTCTTATGGAAC^GAGTCTCTCTAGGCCC 
CTGTCATATCTGCCC MCI GGGCCOGGGGAAAAGTTGGCATCCCCAG I I G I GG I GCTCT 
CCAGGTGCCCT^GGCTGTGGTGG^ 

29490 AACTACAGTCTTCC I GG I I I I IGGI I OGAACTGMTCAGTCCACTCrAGCMCACTTATT 
TCI IGCIGI I CGTAGGCTTCATTATG I G I I IGGI IAAI I I 1 1 I AAAACAACAATAACATA 
TTCCATMTMTTACAGCTTMTTGGCAGAC I G I I I CAGTCTATAGGATCTGCAGGAAGG 
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AGGAGTAATAMGGGA 1 1 1 1 I GACTCAGCTCTTATGGAACAG^GTCPCTCTAGGCCCCTG 
TCATATCTCCCU I C I GQGCCCKXSQGAAAAGnXKaCATCCCCAGnUTXXnTOCrCCA 
[G,A] 

GTTCCCTCAGGCTTnGGTCGAGGGA^ 

AQGCTAQQQGCTCAAAGAAGCTTCrcrACMCnXXrTUrrcACT^^ 
ACCATCCAGCCAGGCCTTCCT^GGACATO^^ 

CACTCCAATAATGAL I GG 1 1 LI I I I ACCCCATMGGlTGAGAATTTACCTGr AAACATTT 
TTXTTCTGAAGMTTTCGAlXnAAGTCAGOGCroGGCCT 

29934 GGACATCQGAGGCXXTATTXiriTMCATCTCTA 

TTACCCCATAAGGTTCAGMTTTACCTCTAAACA I I I I ICilClbAAGAATTTCGATGTAA 
GTTC^GGGCTCGGCCrCTATCTTATCrCACrrcGCTTCT 

TTGI ICI lACACATCCTAGATGCAI^O'MCTATTTCCrMTTATTAGAAATCrATTAGA 
ATO^TPGATTTCAGCTOQGLI I GG I GGCTCCTTCCTCTMTCCCAG<^CTTTCGGAGGC 
[T,C] 

AAGGCTOGAGGATCACCTC^GTCCAGGAGT^ 

CCPGTTCTCTAG^AAAMTAAAAMT^^ 

CTACTO\GGAGGCPGAGGCAGGAGGATCT^^ 

MTGATTCTGCG^CTGCACTC(^&^ 

AAAAAAAAAAMCrrR^TTTCTATTTCGATAGATAAATAAT^ 

34480 CTOXCTTCAAGTGATCCACCCGC^^ 

CACTCTXKCCAGCTXXTCrCTATATTTTTMTACATAT^ 

AGTTCATTTTATAGATCAGGAAACTAGGCCAGAGAAGT AAAATATCTTCCCCAAQ\TGAT 

GTMCTAGrAAGnK50\QGAT<^GATTCAMCO^ 

MGAATGTGGCCACTGTCGAAGGTCCAAQGCCTTGACMC^ 

[A,G] 

CTAGMGGAAAGAGATGGCATGGGCTCAGC^GGCCAGGGAGCTCrTAG 
GGAAGCrCAGAAGGGAGGAAGAGG fTCTCTCI (jCAGGTMGTCCTGAGAACACACCAGAC 
TTTTX^GAGGTGGAGCrrC^TAGCCAGGTC^TTAGGGGAGMGGGAGCT 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 AGAGACGGGGTCTTACTA I G I IGCCCAGGCTC 
GTCITGMCTCCTGGGCTCMGTGATCCrCCCACCTCAGCCrCCCAMGTG 

38812 AMTCCAGCAGATCCATTGAGAGTTTMGCAGCAAGG IGI IGI GACCAAGTTAACATTTT 
A6AAGGATCACTGGTATQGAQGTTCGATTCGAGAGQQGAMGCCTAAAG 
AGTTAGGMGCTATTCTAG&Ott^TGGTT^ 
GAGGCTGAGGTGGGAGGATTTXXTCAGGG^GGAG^ 

CAAGACCCCGTCTCIGI I I I I (J I AATTAAAAGAAAAGTCCAGACGTAGACATAGTGGCr 
[T,C] 

ACGCCTGTAATGCCAGCACrrTGGGAGGCCAAGGTCGGCAGA I IGU IGAGGTCAAGAGT 
TTGGGATTAGGCCAGGCGCAGTTXCTCAGGCCTC 
GTO3GGGGATCACMGGTCAGGAGATCMGACCATC 
CTCTACTAAAAGTACAAAMTTAGCGGG^ 

TCGGGAGGCT^GGCAGGAGMTGGCGTGAACCrAGGAGGCGGAGC 1 1 GCI G I GAGCAGA 

40731 G 1 1 U G I CCTAlXn-OGTCrCTCGGATGAAGOGAGrrGGCTTT 

TAGGAAAGGMCCAGCreGCCAGGGACAGACTATGAGGATTXH^^ 

TGTGGGGATCACAGTTTACAGCCAGAGXZCrcTXKjGGACCCAG 

TAGAMCCTGAGAGTCAGTCTCTUrCCACreAACrCCTMGOG^ 

GCTAMCCCTGAAGGGCMCATGGCCTATGGAG7\MGXATCG7VGCT 

[C,G] 

GGGCACAGATAGGATTXW\TAMTTGTGTAGAMGACrTTGAAMCM 

TGAATGAACG I I I I 1 1 I I AGACTTCAGGGACG\ACAACCCCCAAACCCCAGATTCTGCCA 

GCTCCATCGGGAAGGAGMGTTGCCTr^GTGGAAGCCCCAA 

MGAAGTO\AGAGCACTGGCTCCCAGGCAGAMTACTGATACCCTAC7GX 

TGAGCrCCTCCOT^OW^TCAOTCATCrCrCTGAGCCrG I I ILIGCATCTGTGACAT 

41303 CTCTCAGCUGI I I CTGCATCTGTGACATMGATGGTMGATAMGGTGGCTtrrCrCACC 
MTTATGTMGGATTAAATXjTGGAAMGGAC^TAMGTTCT 
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AGTUTTC^GTAMCGTTGACACATTCTTAGrATO\arAAGAATCAGGI I CI IGGCCAGGCA 
CCJGTTSGCTCATGCCTXJrMTCCOVACACTCTQQGAQQCCTAGOT^ 
CACAQGAGrrTTykGACCAGCCTCAGCAACATAGTGAGACACTCTCT 
[T,A] 

MTAATAATA A I l b 1 1 1 1 1 M TTAGATCGGCAGGGCACTXjTX5GCrCACACCTCrAATCCC 
AGCACTTTCGG^GGCCAAGGCCGGAQGA I IbU I LAGGCCAGGAGTTCAGGAGGXGCCTC 
GGCCACATTCCTCTCTCTACA^ 
CTMTCCCAGCTACTCA^GAGGCTCA^ 

TGC^GreAGCCn^TCAOVCCACTGTACTACAGCTTGGGCMCA^ 

41305 OTttGCaUriTCnXj*^^ 

TTATCTMGGATTAMTCTCGAAAAGGAC^TAAAGTRrrATA 

TGFTCAGTAMOGK^CACATTGTAGTATCACTAAGAATCAQLi I I CI I GGCCAQGCACC 
GTXsGCTCATCCCTXjrMTCCCMCACrcroQGAQGCCrA^ 
CAGGAGTTTCAGACCAGCCreAGCAACATAGTTGAGAC^C^ 
[-,A] 

TAATAATAAI Ibl I I I I MTTAGATQGGCAGGGCACK7KiGCTCA(^CCTGTAATCCCAG 
CACTTTGGGAGGCCAAGGCCGGAGGA I IbU I CAGGCC^GGAGTTCAGGAGCAGCCTCGG 
CCACATTCCTGTCTCrAOXAAGAATAAAAAAGTT MCTGGGCATGGTGGCACATGCCTGT 
MTCCCAGCTACTCMC^GGCTT^GGAGGAGGA^ 
CAlCTTGAGCCTTO^TCACACCACTTJ^^ 

41457 CTAAGAATCAGGTTCTTCGCCAGGCAC^^ 
AGGCCTAGGTGGGAGGATCGGT^^ 

GAGACACTGTCTCrACAAAAAAAAAATAATAATAATAA I lb I I I \ I MTTAGATGQGCAG 

GGCACTUTOCCTIIACACCTC^ 

TGAGGCCAGGAGTTCAGGA(X!AGCCTGGG^ 

[G,C] 

TTMCTGGGC^TGGTCGCACATGCCTirrMTCCCAGCTACT 
ATKKZCTGAGCCCAGGAGTTCAAGACTGCAGK^GCCrTGATCAC^ 
CrrGGGCAACAGAGTXy\GACCTTCTXTCCAAAAAAAAAA G I rTCTTTTTTTI IATCCACT 
CTCCTCACCAMCAAACTGAGTAAGrrAGAGCCCTCTCAGCTGGCA I b I b I I QGAAACAG 
TGCCCTCrCATTAMGTCCTlGCCCT^CTCCCATTK 

43168 AGCTACTTGGGAGGCTC^GCX^^ 

AGCCGAGATCGrGCCATTGCACTTCAGCCTGGGCGACAGAGCGAGACrCTXTTCrC^^ 

TMTMTMTMGWAACTAGCCGGGCC^^ 

AGGAGGCGGAGGCATCAGACTCAGGTT^CTAG^ 

TCACACCAOGCACrC(^GCCraGTTX^O^GAGCGAGA 

[A.-.T] 

CCCATTTGCTCA I I I I I I GGATACrAGTATMCrATCACrcrAMCCAGTTAGTACTTAA 
ATCAAGCAGATATGGGAGATGGK^TTACCATCTACACj I b I I b I CATATATGTCACATA 
CTGAGCATTATCAGCTAGTAGMTCTA^ 

CCCAI lllbAAIblbllll I ACTATGCTTAMTAAATGACrGATGTCAGCMCCCCAAAA 
TGATACATCTGATCTAAGAGCCCCrGTTCCCCAATMTMCATCrAMCTATAGAC^^ 

43357 AQGCATGAGACTCAQGTGMCTAGQGAGACAGAGGTTGCAGTGAGCCM 

TGCACTCCAGCL I bb I I b ACAGAGOGAGACTCTCTCTCAAAAAAAAAAAMTCCCATTTC 
CTCAI 1 1 1 1 I QGATACTAGTATMCTATCACTCTAMCCAGTTAGTACTTAMTCAAGCA 
GATATGGGAGATCGTCAATTACCAT^ 

TATCAGCTAGTAGAATCT AGTTAA I lb I I CrATCTGTGATGTATGCAGAGTTCCCATTTT 
[T,G] 

AAI bib I I I I I ACTATGCTTAMTAMTGACKyVTTJrCAGCMCCCCAAMTGATA 
TGATXnMGAGCCCCTUTTCCCCMTMTMCATCrAMCTA^ 
GGTGCCCCTAAGTTTCCTCCCTCCAGGb 1 1 1 LI I GGCCGGTCTCTGAGGACTACACATCC 
CTACTCCCGTCTTTCCTCATCrrCAQGaXAGTMCAGTATCTCC^ 
AGCTCCCCAAAGGAGCCCC IbLlbl I CAGCCGTGACATCAGCCGCTCAGAATCCCTTCGT 

45664 CCAGCTTTCCTTGGCTTCCCCCA^ 
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MTCKXTCAAGTTCATTQGTTJnjC"! G I ACAAQGATAAGAAGCRiAACCTGCTGACAGAG 
TACATTGAQGGGGGCACACTGAAGGAL I I I L I CaQGCAGTATlGGTGAGCACACCACCCCAT 
AGTCTCCAGGAGCC I I GG I GQGTTGTGVGACACCTATGCTATC^CrACCCTAQGAGCTTA 
AAGGGCAGAGGGGCCL I GL I 1 1 GCCTCCAMGGACCATGCreGGTCGGACrGAGCATAG\ 
[T,C] 

AGGGAGGCTTCACnX^GACCACATTCACCCATOs^^ 
GCTCMCAGCCTCPGAAAATCATTCCCCATTC^ 
GGTCAGGTTTiGCCAMG(W\TC^ 
C^GGGCGAGAGTAQGGAGAQGTUreAGMTTTjnjGGCT 

TCCTATGCAACI I G I G I GGGCTGGCTX^GCAGCTATTCATnGAG I I IGILIGIGICACTC 

47549 MTTAGCTCXSGGJTCXnTXTITKAGK 

AGMTAGCTTGAACCTCXXjAGGCAGAAGT^ 
CAGCCreQGTl^CAGAGrrcAGACrTCATCrOWWWVAAAAA 
GTTACTACATTCGGGTCGMTCCGGAGGGTC(^GGGAATGGAGCCOr^TA 
ATGAAACATTTCAGA I 1 1 L I (^A^TTMGGTAGrTOGCTGTGGQGACAGGAGCCrcGGAGGC 
[A,C] 

GGGTOGAGTCAGAATCGAGAGA<_ I GG 1 1 GGCAATGAGQGAACAGGAGGAGGAQGAQGAQG 
AGTTAGGAGTCGCTTGAGGTCnX^CXrACC^^ 

I IGI rGAGCAACTGGl 1 1 GGGAAGAGCTAGCATTCATCCL I GL I G I I L I G I GCTAGCAGA 
ACCTATCAGCATL I I C I GGQG^GGAMCreGCTCCATGAGACTTKXTrAGGGAGAQGCTC 
CTAGTCACCrMTCTC<^Gtt 

47908 GGAGTTAGGAGTOKTT&AGCTr^ 

GATTGTTGAGCAALI GG I 1 I GGGAAGAGCTAGCATTCATCCL I GL I G 1 1 L I G tGCTAGCA 
GAACCTATCAGCATLI I e I GGGCAQGAMCTOGCrCCATGAGACrOGCTTAGGGAGAGQC 
TGCTACTCACCTMTCTGOVGAGAAGGGGCAGCTC 

ATGTAGCTG^TG^SGGGTGTCTCAGL I I G I GAAGAGGAGATQGCTTTGAGCAGQGCTGACA 
[C,A] 

TGAAMGGCTIGGMGAAAAAMCAGACAC^CMGAGTCTCAQGATCAGGTAG 
AG I I G I GGACAGTCTTTGAGGAGCACTCCCTCAGGCAGGC^^ 
ATAGCGATTCAG(W\(^GCrCCCTGGCT^ 
AGTAGTATTTXAGGGGGCTCGAGAGCMGGAGTGGCTC 
AGAAAGGAAGTTCXTCCTEAGAGTGGTAAGACT 

52267 TTGTGAGGGGTAGAGGAGAGGAGAGACAAGGGATGGTTAGGA^ I G I 1 1 IG 

I I I I IGI I I I IGI I I I I GAGATGGAGTTTCACTCTXJTC^CCCAGGCTTKaAGTCCAGAGGr 
GCMTCXTGGCTCACTCXAGCCTCCGCCTC^^ 

TCCCAAGTAQCK3QGACrAO\GGnrnXGCCACCAC!GCCTOGCrAA I 1 1 I IGIATTTTCA 

GTAGAGACAGQGTTTCGCCATATTCGCCAGQCTOGrcrCAAATK 

[C,A] 

CACCGGCTTCAGCCTCCCAAAGTCCTCAGATTACAGGCATGAGCr^ 

GAAGGAAGAI I IGI I I IAAAAAAI IGI I I ILI I IMTATTAATTGAACACCrcnGTTCAG 

AGCACTGGGC I GG I GCCAGAGGGTTTG^GACATGA^TCAGATCCAGCACCrCATAGAGCC 

TTMTCKXKACAG^GXCAGVGCC^CAAGGAG^CACAGACAAGGCAGGOT 

GAAGTTAGGAGCAGATGCTGATTTGGAAC^^ ILI I AG 

54654 GGCCCCGGCCCCGGCCCCCAGGCC^^ 

GAGCCCCCCCCGGGGCGGCAGG^GAGGGCCCGGGCTGGGCGGATCATCAGGGC 

GGCGCCMGGGAAGGTCACCATCMGTATO\CCCCMGGAGCTACGGAAGCACCrCAACC 

TAGAGGAGTGGATCCreGAGCAGCTCACGCGCCrcrACGACTGCCAGGMGAGG^ 

CAGMCTAGAGATTGACGTGGATGAGCTCCTO^ 

[T,C] 

CAGGGTCAAQGAGCTGCTGGTTGACTXrrTACAMCCCACAGAGK 
GCTGGA<^AGATCGGGGCCATGCAGAAGCTGAGCAC^^ 

GACCCAGGCGflACGGTGGCTCCCATAGGACAATCGCTACCCCCCGACCTCGTAGCAAG\G 
CAATACCGGQQGACCCreCQGCCAGGCCroGrrCCATG^ 

GCCC^GGGGTCTCrTCCCCTGCCCCCTCAGTTTTCCAL I I I IGGAI I I 1 1 1 IATTCTTAT 
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54679 GGCAGTGGCGGCCMGGACCACGCATCTACTTTCAGAGCCCCCCCG^^ 

GGGCCCGGGCTCGGCGGAT^TCAGQGCCCAGreAGGCGCCAAGGGMGCT 

GTATC^CCCOV\GGAGCTACGGM^ 

CACGCXSCCTCTAO^CTGCGIVGGAAGAGGAGAT^^ 

GCTCCTCGACATGGAGAGTGACGA^ 

[C,G] 

tctitacamcccacagaggccttcat^^ 
mgctgagcacaccccagaagaagtcagggtccccgaccca^ 
aggacaatcqctacccccggaccrogtagcaacagcaatacgqqqqgacccrgcqgccag 
gcctcgttcgotgaggagggctcctan^ 

ctcagttttccag i i i pqga1 i 1 1 i i iai pg1 rattaaactgatcggacl i iugi i i i i 

54693 aggaccacgcatctactttcagagc^^ 

gggatcatgagggcccagtga,ggcgccaaggga^ggt caccatcaagtatg^ccccaaqg 
agct acggaagc^cctcmcctag^qgagtgg^tccrggagc^gctcacgcgccrcracg 
actxxeaggaagaggagatctcagmctagagatt^^ 

AGAGreACGATGCCPQQGCTTCCAGGGTCOAQGAGC. \UL\Uj\ I (jACTGTTACAAACCGA 
[A,C] 

AGAGKCTTCATCTCTGGCCTCCTGGACAAGATCCGG 

CCAGAAGAACHGAGGCTCCCCGACCCAG 

CCCCCGACCTCGTAGGAACAGCMTACCGGGGGAC^^ 

GCAGGGCTCCTCGTGCCCCreGCCCAGGGGTCrCTTCCCCrXKICCCCT 

TTTGGAI I I II I I ATTGTTATTAMCTGATQQGAC I ITCTCSm I IATATTGACTC7GCG 

54706 TACTTTCAGAGCCCCCCCCGGGGCCGGAGGAGAGGGCCGGGGCTGG^ 
CCGAGTGAGGCGCCAAGGGAAGGTCACCATCAAGTATGACCCCAAG 
CCTCMCCTAG^GGAGTGGATCCTGGA^ 
GGAGATCTCAGMCrAGAGATTCACGTGGATGAGCTCCTGGA 
CTGGGCTTCCAGGGTCAAGGAGL IUJUjI IGACTCTTACAAACCCACAGAGGCCTTGAT 
[T,C] 

TCTGGCCTGCTCGACAAG^TCCGGGCCATCCAGAAGCT^ 

GGGTCCCCGACCCAGGCGAACGGTCGCTCCCATAGGACMTCGCTACCCCCCGACCTCGT 
AGCMCAGCMTACCGGGGGACCCTCCGGCCAGGCLIUjI ICGATCAGGAGG GCTCCTCG 
TGCCCCreGCCCAGGGGTCTCTTCCCCTGCCCCCTCAGTTTTCCACI 1 1 IGGAI I 1 1 I I I 
ATTGTTATTAMCTGATGGGACTTTGTbl I I I I ATAT7GACTC7GCGGCACGGGCCCTTT 

54712 CAGAGGCCCCCCCGGQQCGGCAGGAGAGQGCCCGGQCreGGGQGATCATGAGQGGCCA 

GAQGCGCCAAGGGAAGGTCACCATCAAGTATGACCCCAAGGAGGTACGGAAGCACCTCAA 
CCTAGAGGAGTOGATCCTGGAG<AGCTCACGGGCCTCrA^ 
CTCAGAACTAGAGATTCACGTGGATGAGCTCCTCGACATGGAGACT 
TTCCAGGGTCAAGGAGCTGG I GG 1 1 GACTCTTACAAACCCACAGAGGCCTTCATCTCTCG 
[T,C] 

CTGCTGGACAAGATCCGGGCCATGCAGMGCTGAGCACACCCCAGAAGMGTX^QGGTCC 
CCGACCGAGGCGMGGGTGGCTCCCATAGGACAATCGCTACCCCCCXaACCTCXJTAGCAAC 
AGCAATACCGGGGGACCCTGCGGCCAGGCL I Gts I I CCATGAGCAGQGCTCCTCGTGCCCC 
TGGCCCAGGGGTCTCTTCCCCTGCCCCCTCAGTTTTCCAL I I I IGGAI I I I I I IATTGTT 
ATTAAACTGATGGGAL I mSTGTTTI IATATTGACTCTGCGGCACGGGCCCTTTAATAAA 

54799 GTATGACCCCMGGAGCrACGGMGCACCTCMCCTAGAGGAGTG^ 

CACGCGCCrCTACGACTGCCAGGAAGAGGAGATCTCAGMCrAGAGATTGACGTGGATGA 

GCTCCTGGACATQGAGAGTGACX^TGCCTCGGCrTCCAGGGT 

CTGTTAGAMCCCACAGAGGCCTTCATCTCTGGCCTGCrGGACAAGATCCGGGCCA^ 

GAAGCl^GCACACCCCAGAAGMGTCAGGGTCCCCGACCCAGGCGMCGGTGGCTCCCA 

[T,C] 

AGGACMTCGCTACCCCCCGACCTCGTAGGMCAGCAATACCGGGGGACCCTGCGGCCAG 
GCCTGGTTCCATGAGCAGGGCTGCTCGTGCCCCTGGCCCAGGGGTCT 
CTCAGTTTTCCAG I I I IGGAI I I I I I IAI IGI I ATTAAACTGATGGGAG I I IGIGI I I I I 
ATATTCACTCTGCGGCACGGGCCCITrMTAMGCGAGGTAGGCTA 
CTCAAAAAAAAAAAAAAAMTGATTTCCAGCGGTCCACATTAGAGTTGA 
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54819 GGMGCACCTGWCTAGAGGAGra 

AGGAAGAGGAGATCTCAGAACTAGAGATTOV^^ 

ACGATCCCreGGCTTCCAGGGTCAAGGAGL PSCTCG1 I U\CTUTTAGW\CCCACAGAGG 
CCTTCATCTCTGGCCTGC' I GGACAAGATCCGGGCCATCCAGAAGCT^\GCACACCCCAGA 
AGAAGTCAGGGTCCCGGACCCAGGCGMCGGTCGCTCC^ 
[G,A] 

ACCTCGTAGCMCAGCAATACGGGGGGACCCTGCGGCCAGGCL I GG 1 1 CCATGAGCAGGG 

CTCCTCGTGCCCCTGGCCCAGGGGTCTCTTCCCCTGCCCCCTCAGT^ 

1 1 1 1 1 1 1 ATTCTTATTAAACTCATGGGAC 1 1 IGIGI 1 1 1 IATATTGACTCTGGGGCACGG 

GCCCTTTMTAAAGCGAGGTAGGGTACGCCTTTGGTGCAGCTCAAAAAA 

TCATTTCCAGGGGTCCACATTAGAGTTGAAA I I I I L I GGTGGGAGAATCTATACL I IGI I 

55499 TTGTTTTCTMTACCTCrTGTCATTCTAMTATCTTTM 
ACAGTATTCAATGCCTAaGTGTGCrAGGTACAGTT 
MCAAWAAAGGTXXTTACCCTCATAGAACATAGATTCTA 
ATACAGTAGATACMTAAGTAAACTATATTCAATATTAGMTC 
AAAGAGTCAAGACMGTAAAGACGATTGTTCAGG^ 
[C,T] 

GTCAGAGCAGGCCTCACTGAGGTGAGATCACATTT^ 

TAAGCCTCAGC I G I CI I AGGGTTGCGGGGCAGCCMGCCATrTTCGGTGGCACTAGGAGCC 
TGGTGTTTCCGATTCCACCTTTGATMCTGCATTTTCrCT 

TTCTCCTATTb I I I I I AAGTATFAAGTCGAGCTAGTCCAGCC I I G 1 1 ATAGTGTTACCTA 
ATCTTTATAGCAMTATATGAGGTACGGGTMCATTA^ 

56825 ACTCATCGCTCAAAGGGTGTCAAAAAGTCAGTCA 

GTCCTTCCTGGAGCAAGGT7GAGGGAGTAGG 1 1 I I GAAGAGTCCCTTAATATGTGGTGGA 

ACAGGCCAGGAGTTAGAGAAAGGGCTGGC I I Ufa 1 1 1 ACCTGCTCACTCGCTCTAGCCAG 

CCCAGGGACCACATCAATGTGAGAGGAAGCCTCCACCTCAIGI 1 1 ICAAACTTAATACTG 

GAGACTGGCTCAGAACTTACGGACAACATCLI I I U b I (J (jAAACAAACAGTCACAAGCA 

[C,A] • ! v , •:, 

AGGAAGAGGCTGGGGGACTAGAAAGAGGCCOGCCCTCTAGAAA^ 

OGTTACTCATACTCGGGTGGGCrCCTrAGTCAGATGCCrA^ 

CGATQGGI I L I GGAGGTXCAGTGTGGCTTGTCACAGGCGTAGAGTCTG^GGGAGGGGA&TG 

GGAGTCTCAGCAATCTCrTGGTCrrGGCrrCATGGCAAC 

CCIGGI I I AGGCAGCAGCnGGGCTGGGMGAGGTGGTGGCAGAGTCTCAAAGCTCAGA 

58871 CGTCACCCttCCACCCMCCCC^ 

CATTTTMCTACCTCCACCTTGGAMCMTTCC^ 

ACCACU IGI IGGGAGGCCTGCACACLIGILI I ICCIGU I CAACCTGAAAGATTCCTGA 

TCATGATMTCTGGACACAGAAGCGGGGCACGGTQGCTCTAGCCTGTMT 

TGGGAGGCCTCAGCAGGTGGATCACCTX^GATCAAGAGTTTGAGM 

[T,A] 

GGTCAMCCCGGTCTCTACrAAAMTACAAAMTTAGCCAGGTGTGGTGGCACAT^ 

TMTCCCAGCTACTCTGGAGGCTGAGGCAGGAGMTCGCTrGAACCCACAAGGCAGAGCT 

TGCAGTGAGGGGAGATCATGCCATTGCACTCCAGCCTGTGCAAC^ 

CTCAAAAAAAA 



FIGURE 3-30 



